Post-Lasso Inference for High-Dimensional Regression
X. Jessie Jeng, Huimin Peng, Wenbin Lu

TL;DR
This paper introduces a novel variable selection method for high-dimensional regression that improves accuracy by controlling noise variable inclusion using post-selection inference techniques.
Contribution
It proposes a new approach to variable selection that identifies the positions of relevant and noise variables on the Lasso path, enhancing selection precision.
Findings
Method outperforms existing techniques in selection accuracy.
Provides better interpretability of selected variables.
Utilizes covariance and Q statistics for post-selection inference.
Abstract
Among the most popular variable selection procedures in high-dimensional regression, Lasso provides a solution path to rank the variables and determines a cut-off position on the path to select variables and estimate coefficients. In this paper, we consider variable selection from a new perspective motivated by the frequently occurred phenomenon that relevant variables are not completely distinguishable from noise variables on the solution path. We propose to characterize the positions of the first noise variable and the last relevant variable on the path. We then develop a new variable selection procedure to control over-selection of the noise variables ranking after the last relevant variable, and, at the same time, retain a high proportion of relevant variables ranking before the first noise variable. Our procedure utilizes the recently developed covariance test statistic and Q…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Bayesian Methods and Mixture Models · Statistical Methods and Bayesian Inference
