Estimation Stability with Cross Validation (ESCV)

Chinghway Lim; Bin Yu

arXiv:1303.3128·stat.ME·October 28, 2015·5 cites

Estimation Stability with Cross Validation (ESCV)

Chinghway Lim, Bin Yu

PDF

Open Access

TL;DR

This paper introduces ESCV, a new model selection criterion based on estimation stability, which improves model interpretability and reduces false positives compared to traditional cross-validation in high-dimensional sparse modeling.

Contribution

ESCV provides a model-free, stability-based criterion for regularization parameter selection that outperforms CV in false positive control and model plausibility.

Findings

01

ESCV reduces false positive rates significantly.

02

ESCV models are smaller and more plausible.

03

ESCV maintains similar prediction performance as CV.

Abstract

Cross-validation (CV) is often used to select the regularization parameter in high dimensional problems. However, when applied to the sparse modeling method Lasso, CV leads to models that are unstable in high-dimensions, and consequently not suited for reliable interpretation. In this paper, we propose a model-free criterion ESCV based on a new estimation stability (ES) metric and CV. Our proposed ESCV finds a locally ES-optimal model smaller than the CV choice so that the it fits the data and also enjoys estimation stability property. We demonstrate that ESCV is an effective alternative to CV at a similar easily parallelizable computational cost. In particular, we compare the two approaches with respect to several performance measures when applied to the Lasso on both simulated and real data sets. For dependent predictors common in practice, our main finding is that, ESCV cuts down…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Single-cell and spatial transcriptomics · Birth, Development, and Health