A significance test for the lasso
Richard Lockhart, Jonathan Taylor, Ryan J. Tibshirani, Robert, Tibshirani

TL;DR
This paper introduces a significance test for variables entering the lasso model, using a covariance-based statistic that asymptotically follows an exponential distribution under the null hypothesis, even in high-dimensional settings.
Contribution
It proposes a novel significance test for lasso variables that accounts for adaptivity and shrinkage, applicable in high-dimensional linear regression.
Findings
The covariance test statistic follows an Exp(1) distribution under the null hypothesis.
The method is valid even when the true model is high-dimensional and not perfectly recovered.
It explicitly accounts for the adaptivity of the lasso selection process.
Abstract
In the sparse linear regression setting, we consider testing the significance of the predictor variable that enters the current lasso model, in the sequence of models visited along the lasso solution path. We propose a simple test statistic based on lasso fitted values, called the covariance test statistic, and show that when the true model is linear, this statistic has an asymptotic distribution under the null hypothesis (the null being that all truly active variables are contained in the current lasso model). Our proof of this result for the special case of the first predictor to enter the model (i.e., testing for a single significant predictor variable against the global null) requires only weak assumptions on the predictor matrix . On the other hand, our proof for a general step in the lasso path places further technical assumptions on and the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
