Variable Selection in GLM and Cox Models with Second-Generation P-Values
Yi Zuo, Thomas G. Stewart, Jeffrey D. Blume

TL;DR
This paper extends the ProSGPV method for variable selection to generalized linear and Cox models, demonstrating its efficiency, adaptability, and strong performance in support recovery, parameter estimation, and prediction across various settings.
Contribution
It introduces a largely tuning-parameter free extension of ProSGPV to GLMs and Cox models, enhancing variable selection methods with strong theoretical and empirical support.
Findings
ProSGPV performs well in support recovery and parameter estimation.
The method maintains strong prediction accuracy.
It is computationally fast and adaptable to different regularization schemes.
Abstract
Variable selection has become a pivotal choice in data analyses that impacts subsequent inference and prediction. In linear models, variable selection using Second-Generation P-Values (SGPV) has been shown to be as good as any other algorithm available to researchers. Here we extend the idea of Penalized Regression with Second-Generation P-Values (ProSGPV) to the generalized linear model (GLM) and Cox regression settings. The proposed ProSGPV extension is largely free of tuning parameters, adaptable to various regularization schemes and null bound specifications, and is computationally fast. Like in the linear case, it excels in support recovery and parameter estimation while maintaining strong prediction performance. The algorithm also preforms as well as its competitors in the high dimensional setting (n>p). Slight modifications of the algorithm improve its performance when data are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Statistical Methods and Inference · Control Systems and Identification
