Conditional predictive inference for stable algorithms
Lukas Steinberger, Hannes Leeb

TL;DR
This paper develops prediction intervals based on k-fold cross validation that are conditionally valid given the training data, especially effective with stable algorithms and high-dimensional data, providing reliable predictive inference.
Contribution
It introduces a finite sample analysis of cross validation residuals ensuring conditional coverage, applicable in high-dimensional, non-parametric settings with minimal assumptions.
Findings
Prediction intervals are close to nominal coverage under stability.
Results hold in high-dimensional linear models with increasing sample size and dimension.
Cross validation can provide reliable predictive inference even when parameter estimation fails.
Abstract
We investigate generically applicable and intuitively appealing prediction intervals based on -fold cross validation. We focus on the conditional coverage probability of the proposed intervals, given the observations in the training sample (hence, training conditional validity), and show that it is close to the nominal level, in an appropriate sense, provided that the underlying algorithm used for computing point predictions is sufficiently stable when feature-response pairs are omitted. Our results are based on a finite sample analysis of the empirical distribution function of -fold cross validation residuals and hold in non-parametric settings with only minimal assumptions on the error distribution. To illustrate our results, we also apply them to high-dimensional linear predictors, where we obtain uniform asymptotic training conditional validity as both sample size and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Probabilistic and Robust Engineering Design · Machine Learning and Algorithms
