A Selective Overview of Variable Selection in High Dimensional Feature Space (Invited Review Article)
Jianqing Fan, Jinchi Lv

TL;DR
This review summarizes recent theoretical and methodological advances in high-dimensional variable selection, focusing on penalized likelihood methods, their properties, and applications to ultra-high dimensional data.
Contribution
It provides a comprehensive overview of developments in penalized likelihood approaches, including non-concave penalties and ultra-high dimensional techniques like screening methods.
Findings
Non-concave penalized likelihood methods have desirable statistical properties.
Screening and two-scale methods are effective for ultra-high dimensional variable selection.
Recent theory extends the applicability of variable selection techniques to higher dimensions.
Abstract
High dimensional statistical problems arise from diverse fields of scientific research and technological development. Variable selection plays a pivotal role in contemporary statistical learning and scientific discoveries. The traditional idea of best subset selection methods, which can be regarded as a specific form of penalized likelihood, is computationally too expensive for many modern statistical applications. Other forms of penalized likelihood methods have been successfully developed over the last decade to cope with high dimensionality. They have been widely applied for simultaneously selecting important variables and estimating their effects in high dimensional statistical inference. In this article, we present a brief account of the recent developments of theory, methods, and implementations for high dimensional variable selection. What limits of the dimensionality such…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Bayesian Methods and Mixture Models · Sparse and Compressive Sensing Techniques
