Concentration inequalities of the cross-validation estimator for   Empirical Risk Minimiser

Matthieu Cornec

arXiv:1011.0096·stat.ML·November 2, 2010·2 cites

Concentration inequalities of the cross-validation estimator for Empirical Risk Minimiser

Matthieu Cornec

PDF

Open Access

TL;DR

This paper establishes concentration inequalities for cross-validation estimates of generalization error in empirical risk minimizers, covering various procedures and providing insights on their consistency and convergence rates.

Contribution

It introduces general bounds for cross-validation error estimates applicable to diverse procedures and predictor classes, highlighting their consistency without requiring test sample size to grow.

Findings

01

Provides bounds showing worst-case error close to training error

02

Analyzes convergence rates of different cross-validation methods

03

Offers guidelines for choosing cross-validation procedures

Abstract

In this article, we derive concentration inequalities for the cross-validation estimate of the generalization error for empirical risk minimizers. In the general setting, we prove sanity-check bounds in the spirit of \cite{KR99} \textquotedblleft\textit{bounds showing that the worst-case error of this estimate is not much worse that of training error estimate} \textquotedblright . General loss functions and class of predictors with finite VC-dimension are considered. We closely follow the formalism introduced by \cite{DUD03} to cover a large variety of cross-validation procedures including leave-one-out cross-validation, $k$ % -fold cross-validation, hold-out cross-validation (or split sample), and the leave- $υ$ -out cross-validation. In particular, we focus on proving the consistency of the various cross-validation procedures. We point out the interest of each cross-validation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Sparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques