Robust Variable and Interaction Selection for Logistic Regression and Multiple Index Models
Yang Li, Jun S. Liu

TL;DR
This paper introduces SODA, a robust stepwise method for variable and interaction selection in logistic regression and multiple index models, effective in high-dimensional, non-Gaussian data.
Contribution
SODA is a novel, robust variable selection method that handles high-dimensional data without normality assumptions, extending to multiple index models.
Findings
SODA outperforms existing methods in high-dimensional QDA variable selection.
SODA is robust to non-Gaussian predictor distributions.
Theoretical guarantees confirm SODA's variable-selection consistency.
Abstract
We propose Stepwise cOnditional likelihood variable selection for Discriminant Analysis (SODA) to detect both main and quadratic interaction effects in logistic regression and quadratic discriminant analysis (QDA) models. In the forward stage, SODA adds in important predictors evaluated based on their overall contributions, whereas in the backward stage SODA removes unimportant terms so as to optimize the extended Bayesian Information Criterion (EBIC). Compared with existing methods on QDA variable selections, SODA can deal with high-dimensional data with the number of predictors much larger than the sample size and does not require the joint normality assumption on predictors, leading to much enhanced robustness. We further extend SODA to conduct variable selection and model fitting for multiple index models. Compared with existing variable selection methods based on the Sliced Inverse…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Statistical Methods and Inference · Fuzzy Systems and Optimization
