Massive parallelization of serial inference algorithms for a complex generalized linear model
Marc A. Suchard, Shawn E. Simpson, Ivan Zorych, Patrick Ryan, David, Madigan

TL;DR
This paper demonstrates how leveraging GPU-based parallel computing dramatically accelerates the fitting of complex generalized linear models on large-scale health data, enabling more efficient drug safety analysis.
Contribution
It introduces a massively parallelized coordinate descent algorithm for fitting complex generalized linear models on large datasets, significantly improving computational efficiency.
Findings
Orders-of-magnitude speedup in model fitting time
Successful application to datasets with tens of millions of observations
Potential to enhance drug safety surveillance methods
Abstract
Following a series of high-profile drug safety disasters in recent years, many countries are redoubling their efforts to ensure the safety of licensed medical products. Large-scale observational databases such as claims databases or electronic health record systems are attracting particular attention in this regard, but present significant methodological and computational concerns. In this paper we show how high-performance statistical computation, including graphics processing units, relatively inexpensive highly parallel computing devices, can enable complex methods in large databases. We focus on optimization and massive parallelization of cyclic coordinate descent approaches to fit a conditioned generalized linear model involving tens of millions of observations and thousands of predictors in a Bayesian context. We find orders-of-magnitude improvement in overall run-time. Coordinate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Bayesian Inference · Statistical Methods and Inference · Bayesian Methods and Mixture Models
