Lasso-type recovery of sparse representations for high-dimensional data
Nicolai Meinshausen, Bin Yu

TL;DR
This paper investigates the behavior of Lasso estimators in high-dimensional settings when the irrepresentable condition is violated, showing they remain consistent in norm and can effectively identify important variables.
Contribution
It relaxes the irrepresentable condition for Lasso, demonstrating norm consistency and variable selection effectiveness under weaker assumptions.
Findings
Lasso remains consistent in the -norm without irrepresentable condition.
The estimator can identify all important variables with high probability.
The convergence rate is optimal under bounded sparse eigenvalues.
Abstract
The Lasso is an attractive technique for regularization and variable selection for high-dimensional data, where the number of predictor variables is potentially much larger than the number of samples . However, it was recently discovered that the sparsity pattern of the Lasso estimator can only be asymptotically identical to the true sparsity pattern if the design matrix satisfies the so-called irrepresentable condition. The latter condition can easily be violated in the presence of highly correlated variables. Here we examine the behavior of the Lasso estimators if the irrepresentable condition is relaxed. Even though the Lasso cannot recover the correct sparsity pattern, we show that the estimator is still consistent in the -norm sense for fixed designs under conditions on (a) the number of nonzero components of the vector and (b) the minimal singular…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
