Thresholded Lasso for high dimensional variable selection and statistical estimation
Shuheng Zhou

TL;DR
This paper introduces the Thresholded Lasso, a multi-step thresholding method that accurately estimates sparse high-dimensional vectors in linear models, achieving near-oracle performance under certain conditions.
Contribution
The paper proposes the Thresholded Lasso method and demonstrates its ability to achieve sparse oracle inequalities in high-dimensional settings, under restricted eigenvalue conditions.
Findings
Achieves $ ext{l}_2$ loss within a logarithmic factor of the oracle's mean square error.
Recovers the model selection accuracy of $ ext{l}_0$ penalized estimators.
Simulation results confirm the theoretical guarantees.
Abstract
Given noisy samples with dimensions, where , we show that the multi-step thresholding procedure based on the Lasso -- we call it the {\it Thresholded Lasso}, can accurately estimate a sparse vector in a linear model , where is a design matrix normalized to have column norm , and . We show that under the restricted eigenvalue (RE) condition (Bickel-Ritov-Tsybakov 09), it is possible to achieve the loss within a logarithmic factor of the ideal mean square error one would achieve with an {\em oracle} while selecting a sufficiently sparse model -- hence achieving {\it sparse oracle inequalities}; the oracle would supply perfect information about which coordinates are non-zero and which are above the noise level. In some sense, the Thresholded Lasso…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Sparse and Compressive Sensing Techniques · Markov Chains and Monte Carlo Methods
