Model selection with lasso-zero: adding straw to the haystack to better   find needles

Pascaline Descloux; Sylvain Sardy

arXiv:1805.05133·stat.ME·April 15, 2019·J. Comput. Graph. Stat.

Model selection with lasso-zero: adding straw to the haystack to better find needles

Pascaline Descloux, Sylvain Sardy

PDF

Open Access 1 Repo

TL;DR

Lasso-Zero is a novel high-dimensional support recovery method that overfits with noise dictionaries and thresholding, achieving strong theoretical guarantees and competitive empirical performance.

Contribution

It introduces a new overfit-then-threshold approach with noise dictionaries and a universal threshold, improving support recovery in high-dimensional linear models.

Findings

01

Lasso-Zero outperforms competitors in support recovery.

02

It achieves sign consistency under weaker conditions than Lasso.

03

Noise dictionaries enhance performance for low signals.

Abstract

The high-dimensional linear model $y = X β^{0} + ϵ$ is considered and the focus is put on the problem of recovering the support $S^{0}$ of the sparse vector $β^{0} .$ We introduce Lasso-Zero, a new $ℓ_{1}$ -based estimator whose novelty resides in an "overfit, then threshold" paradigm and the use of noise dictionaries concatenated to $X$ for overfitting the response. To select the threshold, we employ the quantile universal threshold based on a pivotal statistic that requires neither knowledge nor preliminary estimation of the noise level. Numerical simulations show that Lasso-Zero performs well in terms of support recovery and provides an excellent trade-off between high true positive rate and low false discovery rate compared to competitors. Our methodology is supported by theoretical results showing that when no noise dictionary is used, Lasso-Zero recovers the signs of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

pascalinedescloux/lasso-zero
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Machine Learning and Algorithms · Gaussian Processes and Bayesian Inference