Variable selection in linear regression models: choosing the best subset is not always the best choice
Moritz Hanke, Louis Dijkstra, Ronja Foraita, Vanessa Didelez

TL;DR
This study compares various variable selection methods in linear regression, revealing that best subset selection is not always superior, especially with correlated variables or lower signal-to-noise ratios.
Contribution
The paper provides a comprehensive neutral comparison of BSS, FSS, Lasso, and Enet, challenging the assumption that BSS is always the best choice.
Findings
BSS outperforms others mainly in high SNR and uncorrelated variables.
FSS performs nearly as well as BSS across many settings.
Enet and other alternatives are faster and often better with correlated predictors.
Abstract
Variable selection in linear regression settings is a much discussed problem. Best subset selection (BSS) is often considered the intuitive 'gold standard', with its use being restricted only by its NP-hard nature. Alternatives such as the least absolute shrinkage and selection operator (Lasso) or the elastic net (Enet) have become methods of choice in high-dimensional settings. A recent proposal represents BSS as a mixed integer optimization problem so that much larger problems have become feasible in reasonable computation time. We present an extensive neutral comparison assessing the variable selection performance, in linear regressions, of BSS compared to forward stepwise selection (FSS), Lasso and Enet. The simulation study considers a wide range of settings that are challenging with regard to dimensionality (with respect to the number of observations and variables),…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMetaheuristic Optimization Algorithms Research · Evolutionary Algorithms and Applications · Neural Networks and Applications
