Bounded-memory adjusted scores estimation in generalized linear models with large data sets
Patrick Zietkiewicz, Ioannis Kosmidis

TL;DR
This paper introduces two memory-efficient iterative algorithms for estimating generalized linear models with bias-reducing adjusted scores, suitable for large-scale and distributed data sets, using incremental QR decompositions.
Contribution
It develops novel IWLS variants that operate with limited memory and can handle distributed data, advancing scalable bias-reduction in GLMs.
Findings
Algorithms successfully applied to datasets with millions of observations.
Memory requirements are significantly reduced compared to traditional methods.
Methods are adaptable for distributed and privacy-sensitive data environments.
Abstract
The widespread use of maximum Jeffreys'-prior penalized likelihood in binomial-response generalized linear models, and in logistic regression, in particular, are supported by the results of Kosmidis and Firth (2021, Biometrika), who show that the resulting estimates are always finite-valued, even in cases where the maximum likelihood estimates are not, which is a practical issue regardless of the size of the data set. In logistic regression, the implied adjusted score equations are formally bias-reducing in asymptotic frameworks with a fixed number of parameters and appear to deliver a substantial reduction in the persistent bias of the maximum likelihood estimator in high-dimensional settings where the number of parameters grows asymptotically as a proportion of the number of observations. In this work, we develop and present two new variants of iteratively reweighted least squares for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Statistical Methods and Models · Statistical Methods and Bayesian Inference · Bayesian Modeling and Causal Inference
