Thresholded Lasso for high dimensional variable selection and   statistical estimation

Shuheng Zhou

arXiv:1002.1583·math.ST·February 11, 2010·43 cites

Thresholded Lasso for high dimensional variable selection and statistical estimation

Shuheng Zhou

PDF

Open Access

TL;DR

This paper introduces the Thresholded Lasso, a multi-step thresholding method that accurately estimates sparse high-dimensional vectors in linear models, achieving near-oracle performance under certain conditions.

Contribution

The paper proposes the Thresholded Lasso method and demonstrates its ability to achieve sparse oracle inequalities in high-dimensional settings, under restricted eigenvalue conditions.

Findings

01

Achieves $ ext{l}_2$ loss within a logarithmic factor of the oracle's mean square error.

02

Recovers the model selection accuracy of $ ext{l}_0$ penalized estimators.

03

Simulation results confirm the theoretical guarantees.

Abstract

Given $n$ noisy samples with $p$ dimensions, where $n ≪ p$ , we show that the multi-step thresholding procedure based on the Lasso -- we call it the {\it Thresholded Lasso}, can accurately estimate a sparse vector $β \in R^{p}$ in a linear model $Y = X β + ϵ$ , where $X_{n \times p}$ is a design matrix normalized to have column $ℓ_{2}$ norm $n$ , and $ϵ \sim N (0, σ^{2} I_{n})$ . We show that under the restricted eigenvalue (RE) condition (Bickel-Ritov-Tsybakov 09), it is possible to achieve the $ℓ_{2}$ loss within a logarithmic factor of the ideal mean square error one would achieve with an {\em oracle} while selecting a sufficiently sparse model -- hence achieving {\it sparse oracle inequalities}; the oracle would supply perfect information about which coordinates are non-zero and which are above the noise level. In some sense, the Thresholded Lasso…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Sparse and Compressive Sensing Techniques · Markov Chains and Monte Carlo Methods