The Structure of Cross-Validation Error: Stability, Covariance, and Minimax Limits
Ido Nachum, R\"udiger Urbanke, and Thomas Weinberger

TL;DR
This paper provides a theoretical analysis of cross-validation error, revealing fundamental limits on its accuracy and how properties like stability influence the optimal number of folds.
Contribution
It introduces a novel decomposition of CV error, a weaker stability notion, and establishes minimax bounds showing inherent trade-offs in CV performance.
Findings
Minimax lower bound of (((k^*)/n)) for CV error.
CV cannot achieve the ideal 1/n error rate for large k.
Trade-off between CV accuracy and number of folds k.
Abstract
Despite ongoing theoretical research on cross-validation (CV), many theoretical questions remain widely open. This motivates our investigation into how properties of algorithm-distribution pairs can affect the choice for the number of folds in -fold CV. Our results consist of a novel decomposition of the mean-squared error of cross-validation for risk estimation, which explicitly captures the correlations of error estimates across overlapping folds and includes a novel algorithmic stability notion, squared loss stability, that is considerably weaker than the typically required hypothesis stability in other comparable works. Furthermore, we prove: 1. For any learning algorithm that minimizes empirical risk, the mean-squared error of the -fold cross-validation estimator of the population risk satisfies the following minimax lower…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Machine Learning and Algorithms · Stochastic Gradient Optimization Techniques
