The Structure of Cross-Validation Error: Stability, Covariance, and Minimax Limits

Ido Nachum; R\"udiger Urbanke; and Thomas Weinberger

arXiv:2511.03554·math.ST·January 9, 2026

The Structure of Cross-Validation Error: Stability, Covariance, and Minimax Limits

Ido Nachum, R\"udiger Urbanke, and Thomas Weinberger

PDF

Open Access

TL;DR

This paper provides a theoretical analysis of cross-validation error, revealing fundamental limits on its accuracy and how properties like stability influence the optimal number of folds.

Contribution

It introduces a novel decomposition of CV error, a weaker stability notion, and establishes minimax bounds showing inherent trade-offs in CV performance.

Findings

01

Minimax lower bound of (((k^*)/n)) for CV error.

02

CV cannot achieve the ideal 1/n error rate for large k.

03

Trade-off between CV accuracy and number of folds k.

Abstract

Despite ongoing theoretical research on cross-validation (CV), many theoretical questions remain widely open. This motivates our investigation into how properties of algorithm-distribution pairs can affect the choice for the number of folds in $k$ -fold CV. Our results consist of a novel decomposition of the mean-squared error of cross-validation for risk estimation, which explicitly captures the correlations of error estimates across overlapping folds and includes a novel algorithmic stability notion, squared loss stability, that is considerably weaker than the typically required hypothesis stability in other comparable works. Furthermore, we prove: 1. For any learning algorithm that minimizes empirical risk, the mean-squared error of the $k$ -fold cross-validation estimator $L_{CV}^{(k)}$ of the population risk $L_{D}$ satisfies the following minimax lower…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Machine Learning and Algorithms · Stochastic Gradient Optimization Techniques