Empirical Bayes Variable Selection with Lasso Statistics in the AMP Framework
Lina Hidmi, Asaf Weinstein

TL;DR
This paper develops an empirical Bayes variable selection method within the AMP framework, optimizing the tradeoff between false discoveries and power in high-dimensional linear regression.
Contribution
It introduces an optimal variable selection procedure based on local false discovery rates, extending AMP theory and demonstrating substantial power gains.
Findings
The proposed method accurately predicts the false discovery-power tradeoff.
Optimal regularization minimizes asymptotic mean squared error.
Numerical studies confirm significant power improvements.
Abstract
The Lasso is one of the most ubiquitous methods for variable selection in high-dimensional linear regression and has been studied extensively under different regimes. In a particular asymptotic setup entailing , an i.i.d.~Gaussian matrix and linear sparsity, \citet{su2017false} analyzed the Lasso selection path and presented negative results, showing that maintaining small levels of the false discovery proportion comes at a substantial cost in power. Followup work by \citet{wang2020bridge} used the same framework to study the tradeoff between type I error and power for thresholded-Lasso selection, which ranks the variables based on the magnitude of the Lasso estimate instead of the order of appearance on the Lasso path, and demonstrated that significant improvements are possible if the regularization parameter is chosen appropriately. We take this line of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Advanced Bandit Algorithms Research · Advanced Causal Inference Techniques
