Cross-validation: what does it estimate and how well does it do it?

Stephen Bates; Trevor Hastie; Robert Tibshirani

arXiv:2104.00673·stat.ME·March 12, 2024·45 cites

Cross-validation: what does it estimate and how well does it do it?

Stephen Bates, Trevor Hastie, Robert Tibshirani

PDF

Open Access 2 Repos

TL;DR

This paper investigates the true nature of cross-validation error estimates, revealing they measure the error of models trained on different data sets, not the same data, and proposes a nested scheme for better confidence intervals.

Contribution

It demonstrates that cross-validation estimates the prediction error of models trained on other datasets, not the same, and introduces a nested cross-validation method for more accurate confidence intervals.

Findings

01

Cross-validation estimates the error of models trained on different datasets.

02

Standard confidence intervals often have lower than desired coverage.

03

Nested cross-validation improves the accuracy of variance estimation.

Abstract

Cross-validation is a widely-used technique to estimate prediction error, but its behavior is complex and not fully understood. Ideally, one would like to think that cross-validation estimates the prediction error for the model at hand, fit to the training data. We prove that this is not the case for the linear model fit by ordinary least squares; rather it estimates the average prediction error of models fit on other unseen training sets drawn from the same population. We further show that this phenomenon occurs for most popular estimates of prediction error, including data splitting, bootstrapping, and Mallow's Cp. Next, the standard confidence intervals for prediction error derived from cross-validation may have coverage far below the desired level. Because each data point is used for both training and testing, there are correlations among the measured accuracies for each fold, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Statistical Methods and Bayesian Inference · Advanced Statistical Methods and Models