Stability Selection

Nicolai Meinshausen; Peter Buehlmann

arXiv:0809.2932·stat.ME·May 16, 2009·35 cites

Stability Selection

Nicolai Meinshausen, Peter Buehlmann

PDF

Open Access 4 Repos

TL;DR

Stability selection is a versatile method combining subsampling with high-dimensional algorithms to improve structure estimation accuracy and control false discoveries, applicable across various statistical models.

Contribution

The paper introduces stability selection, a general approach that enhances variable and structure estimation, providing finite sample error control and proving its consistency for randomized Lasso.

Findings

01

Improves variable selection and structure estimation accuracy.

02

Provides finite sample control of false discovery rates.

03

Proves consistency of stability selection with randomized Lasso.

Abstract

Estimation of structure, such as in variable selection, graphical modelling or cluster analysis is notoriously difficult, especially for high-dimensional data. We introduce stability selection. It is based on subsampling in combination with (high-dimensional) selection algorithms. As such, the method is extremely general and has a very wide range of applicability. Stability selection provides finite sample control for some error rates of false discoveries and hence a transparent principle to choose a proper amount of regularisation for structure estimation. Variable selection and structure estimation improve markedly for a range of selection methods if stability selection is applied. We prove for randomised Lasso that stability selection will be variable selection consistent even if the necessary conditions needed for consistency of the original Lasso method are violated. We demonstrate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Bayesian Methods and Mixture Models · Statistical Methods and Bayesian Inference