Ultrahigh dimensional variable selection: beyond the linear model
Jianqing Fan, Richard Samworth, Yichao Wu

TL;DR
This paper extends iterative sure independence screening (ISIS) to a broader pseudo-likelihood framework, improving variable selection in high-dimensional data, especially for classification tasks where traditional methods fail, and introduces techniques to reduce false discoveries.
Contribution
It generalizes ISIS beyond linear models to a pseudo-likelihood framework, enabling better feature selection and false discovery control in high-dimensional settings.
Findings
Improved variable selection in high-dimensional classification.
Effective reduction of false discovery rate during screening.
Successful application to simulated and real datasets.
Abstract
Variable selection in high-dimensional space characterizes many contemporary problems in scientific discovery and decision making. Many frequently-used techniques are based on independence screening; examples include correlation ranking (Fan and Lv, 2008) or feature selection using a two-sample t-test in high-dimensional classification (Tibshirani et al., 2003). Within the context of the linear model, Fan and Lv (2008)showed that this simple correlation ranking possesses a sure independence screening property under certain conditions and that its revision, called iteratively sure independent screening (ISIS), is needed when the features are marginally unrelated but jointly related to the response variable. In this paper, we extend ISIS, without explicit definition of residuals, to a general pseudo-likelihood framework, which includes generalized linear models as a special case. Even in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Gene expression and cancer classification · Optimal Experimental Design Methods
