Feature-specific inference for penalized regression using local false discovery rates
Ryan Miller, Patrick Breheny

TL;DR
This paper introduces a method to compute local false discovery rates for variables in penalized regression models, enhancing the reliability of feature selection in high-dimensional data analysis.
Contribution
It extends local false discovery rate methodology to penalized regression, applicable across various models and penalties, improving feature significance assessment.
Findings
Method valid for all λ values in penalized models.
Applicable to GLM, Cox models, MCP, SCAD penalties.
Demonstrated utility in genetic data case studies.
Abstract
Penalized regression methods, most notably the lasso, are a popular approach to analyzing high-dimensional data. An attractive property of the lasso is that it naturally performs variable selection. An important area of concern, however, is the reliability of these variable selections. Motivated by local false discovery rate methodology from the large-scale hypothesis testing literature, we propose a method for calculating a local false discovery rate for each variable under consideration by the lasso model. These rates can be used to assess the reliability of an individual feature, or to estimate the model's overall false discovery rate. The method can be used for all values of . This is particularly useful for models with a few highly significant features but a high overall Fdr, which are a relatively common occurrence when using cross validation to select . It is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
