Estimation Stability with Cross Validation (ESCV)
Chinghway Lim, Bin Yu

TL;DR
This paper introduces ESCV, a new model selection criterion based on estimation stability, which improves model interpretability and reduces false positives compared to traditional cross-validation in high-dimensional sparse modeling.
Contribution
ESCV provides a model-free, stability-based criterion for regularization parameter selection that outperforms CV in false positive control and model plausibility.
Findings
ESCV reduces false positive rates significantly.
ESCV models are smaller and more plausible.
ESCV maintains similar prediction performance as CV.
Abstract
Cross-validation (CV) is often used to select the regularization parameter in high dimensional problems. However, when applied to the sparse modeling method Lasso, CV leads to models that are unstable in high-dimensions, and consequently not suited for reliable interpretation. In this paper, we propose a model-free criterion ESCV based on a new estimation stability (ES) metric and CV. Our proposed ESCV finds a locally ES-optimal model smaller than the CV choice so that the it fits the data and also enjoys estimation stability property. We demonstrate that ESCV is an effective alternative to CV at a similar easily parallelizable computational cost. In particular, we compare the two approaches with respect to several performance measures when applied to the Lasso on both simulated and real data sets. For dependent predictors common in practice, our main finding is that, ESCV cuts down…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Single-cell and spatial transcriptomics · Birth, Development, and Health
