Conditional Sure Independence Screening
Emre Barut, Jianqing Fan, Anneleen Verhasselt

TL;DR
This paper introduces Conditional Sure Independence Screening (CSIS) for variable selection in ultrahigh-dimensional generalized linear models, improving accuracy by incorporating prior knowledge and reducing false positives and negatives.
Contribution
The paper develops CSIS, a novel method that leverages prior information to enhance variable screening accuracy in high-dimensional settings, with theoretical guarantees and practical algorithms.
Findings
CSIS reduces false positive and false negative rates.
Theoretical conditions for sure screening and model selection consistency.
Demonstrated effectiveness through simulations and real data analysis.
Abstract
Independence screening is a powerful method for variable selection for `Big Data' when the number of variables is massive. Commonly used independence screening methods are based on marginal correlations or variations of it. In many applications, researchers often have some prior knowledge that a certain set of variables is related to the response. In such a situation, a natural assessment on the relative importance of the other predictors is the conditional contributions of the individual predictors in presence of the known set of variables. This results in conditional sure independence screening (CSIS). Conditioning helps for reducing the false positive and the false negative rates in the variable selection process. In this paper, we propose and study CSIS in the context of generalized linear models. For ultrahigh-dimensional statistical problems, we give conditions under which sure…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Bayesian Modeling and Causal Inference · Advanced Statistical Methods and Models
