Learning sparse generalized linear models with binary outcomes via iterative hard thresholding
Namiko Matsumoto, Arya Mazumdar

TL;DR
This paper introduces BIHT, an iterative hard thresholding algorithm for efficiently estimating sparse generalized linear models with binary outcomes, achieving statistical optimality without requiring knowledge of the link function.
Contribution
The work presents a novel, flexible algorithm for sparse binary GLMs that is both statistically efficient and does not depend on the specific link function, unlike prior methods.
Findings
BIHT converges to the correct sparse solution in general binary GLMs.
In logistic regression, BIHT is statistically optimal with near-minimal sample complexity.
For probit regression, the sample complexity is comparable to logistic regression.
Abstract
In statistics, generalized linear models (GLMs) are widely used for modeling data and can expressively capture potential nonlinear dependence of the model's outcomes on its covariates. Within the broad family of GLMs, those with binary outcomes, which include logistic and probit regressions, are motivated by common tasks such as binary classification with (possibly) non-separable data. In addition, in modern machine learning and statistics, data is often high-dimensional yet has a low intrinsic dimension, making sparsity constraints in models another reasonable consideration. In this work, we propose to use and analyze an iterative hard thresholding (projected gradient descent on the ReLU loss) algorithm, called binary iterative hard thresholding (BIHT), for parameter estimation in sparse GLMs with binary outcomes. We establish that BIHT is statistically efficient and converges to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Face and Expression Recognition · Sparse and Compressive Sensing Techniques
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Logistic Regression
