Concentration inequalities of the cross-validation estimator for Empirical Risk Minimiser
Matthieu Cornec

TL;DR
This paper establishes concentration inequalities for cross-validation estimates of generalization error in empirical risk minimizers, covering various procedures and providing insights on their consistency and convergence rates.
Contribution
It introduces general bounds for cross-validation error estimates applicable to diverse procedures and predictor classes, highlighting their consistency without requiring test sample size to grow.
Findings
Provides bounds showing worst-case error close to training error
Analyzes convergence rates of different cross-validation methods
Offers guidelines for choosing cross-validation procedures
Abstract
In this article, we derive concentration inequalities for the cross-validation estimate of the generalization error for empirical risk minimizers. In the general setting, we prove sanity-check bounds in the spirit of \cite{KR99} \textquotedblleft\textit{bounds showing that the worst-case error of this estimate is not much worse that of training error estimate} \textquotedblright . General loss functions and class of predictors with finite VC-dimension are considered. We closely follow the formalism introduced by \cite{DUD03} to cover a large variety of cross-validation procedures including leave-one-out cross-validation, % -fold cross-validation, hold-out cross-validation (or split sample), and the leave--out cross-validation. In particular, we focus on proving the consistency of the various cross-validation procedures. We point out the interest of each cross-validation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Sparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques
