Optimal Hold-Out Size in Cross-Validation

Kenichiro McAlinn; K\=osaku Takanashi

arXiv:2511.12698·stat.ME·February 3, 2026

Optimal Hold-Out Size in Cross-Validation

Kenichiro McAlinn, K\=osaku Takanashi

PDF

Open Access

TL;DR

This paper develops a principled method for selecting the optimal hold-out size in cross-validation by balancing training accuracy and evaluation uncertainty, improving model comparison and scientific inference.

Contribution

It introduces a utility-based rule for choosing the hold-out size based on irreducible noise, formalizing the tradeoff and providing data-dependent, assumption-aware guidance.

Findings

01

Optimal K depends on data and model complexity.

02

Choice of K varies with irreducible error assumptions.

03

Impacts on scientific conclusions and model comparisons.

Abstract

Cross-validation (CV) is routinely used across the sciences to select models and tune parameters, and the resulting choices are often interpreted as substantive scientific conclusions (e.g., which variables, mechanisms, or risk factors are ``supported by the data''). A key part of the CV procedure -- the hold-out size, or equivalently the fold count $K$ -- is typically set by convention (e.g., 80/20, $K = 5$ ) rather than by a principled criterion. Central to the issue is the tradeoff between training and testing: increasing the training sample size improves model accuracy, while sacrificing certainty around the accuracy itself. We formalize the tradeoff by targeting predictive performance and explicitly penalizing evaluation uncertainty, which cannot be identified from the data without additional assumptions. We derive finite-sample expressions of this evaluation uncertainty under…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Statistical Methods in Clinical Trials · Genetic Associations and Epidemiology