Subsampling bias and the best-discrepancy systematic cross validation

Statistical machine learning models should be evaluated and validated before putting to work. Conventional k-fold Monte Carlo Cross-Validation (MCCV) procedure uses a pseudo-random sequence to partition instances into k subsets, which usually causes subsampling bias, inflates generalization errors a...

Full description

Saved in:
Bibliographic Details
Main Authors: Liang Guo, Jianya Liu, Ruodan Lu
Format: Default Article
Published: 2019
Subjects:
Online Access:https://hdl.handle.net/2134/38228
Tags: Add Tag
No Tags, Be the first to tag this record!