Choice of V for V-Fold Cross-Validation in Least-Squares Density Estimation
Sylvain Arlot (SIERRA, DI-ENS), Matthieu Lerasle (JAD)

TL;DR
This paper provides theoretical insights into choosing the number of folds V in V-fold cross-validation for least-squares density estimation, showing how variance and performance depend on V and supporting the common choice of V=5.
Contribution
It establishes a non-asymptotic oracle inequality for V-fold cross-validation and its bias-corrected version, and derives variance formulas explaining the optimal V choice.
Findings
Variance of V-fold CV depends on V as 1+4/(V-1).
Performance improves significantly from V=2 to V=5 or 10.
V=5 is supported as a practical choice based on theoretical and simulation results.
Abstract
This paper studies V-fold cross-validation for model selection in least-squares density estimation. The goal is to provide theoretical grounds for choosing V in order to minimize the least-squares loss of the selected estimator. We first prove a non-asymptotic oracle inequality for V-fold cross-validation and its bias-corrected version (V-fold penalization). In particular, this result implies that V-fold penalization is asymptotically optimal in the nonparametric case. Then, we compute the variance of V-fold cross-validation and related criteria, as well as the variance of key quantities for model selection performance. We show that these variances depend on V like 1+4/(V-1), at least in some particular cases, suggesting that the performance increases much from V=2 to V=5 or 10, and then is almost constant. Overall, this can explain the common advice to take V=5---at least in our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Advanced Statistical Methods and Models · Optimal Experimental Design Methods
