Optimal Hold-Out Size in Cross-Validation
Kenichiro McAlinn, K\=osaku Takanashi

TL;DR
This paper develops a principled method for selecting the optimal hold-out size in cross-validation by balancing training accuracy and evaluation uncertainty, improving model comparison and scientific inference.
Contribution
It introduces a utility-based rule for choosing the hold-out size based on irreducible noise, formalizing the tradeoff and providing data-dependent, assumption-aware guidance.
Findings
Optimal K depends on data and model complexity.
Choice of K varies with irreducible error assumptions.
Impacts on scientific conclusions and model comparisons.
Abstract
Cross-validation (CV) is routinely used across the sciences to select models and tune parameters, and the resulting choices are often interpreted as substantive scientific conclusions (e.g., which variables, mechanisms, or risk factors are ``supported by the data''). A key part of the CV procedure -- the hold-out size, or equivalently the fold count -- is typically set by convention (e.g., 80/20, ) rather than by a principled criterion. Central to the issue is the tradeoff between training and testing: increasing the training sample size improves model accuracy, while sacrificing certainty around the accuracy itself. We formalize the tradeoff by targeting predictive performance and explicitly penalizing evaluation uncertainty, which cannot be identified from the data without additional assumptions. We derive finite-sample expressions of this evaluation uncertainty under…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Statistical Methods in Clinical Trials · Genetic Associations and Epidemiology
