An Algorithm for Distributed Bayesian Inference in Generalized Linear Models
Nariankadu D. Shyamalkumar, Sanvesh Srivastava

TL;DR
This paper introduces a scalable distributed Bayesian inference algorithm for generalized linear models that efficiently handles massive datasets by dividing data, parallelizing computations, and combining results, maintaining statistical accuracy.
Contribution
It develops a novel divide-and-conquer Bayesian inference method using powered likelihoods, with theoretical guarantees and practical effectiveness demonstrated on linear and logistic regressions.
Findings
Comparable accuracy to state-of-the-art algorithms
Significant computational efficiency gains
Asymptotic optimality under regularity conditions
Abstract
Monte Carlo algorithms, such as Markov chain Monte Carlo (MCMC) and Hamiltonian Monte Carlo (HMC), are routinely used for Bayesian inference in generalized linear models; however, these algorithms are prohibitively slow in massive data settings because they require multiple passes through the full data in every iteration. Addressing this problem, we develop a scalable extension of these algorithms using the divide-and-conquer (D&C) technique that divides the data into a sufficiently large number of subsets, draws parameters in parallel on the subsets using a \textit{powered} likelihood, and produces Monte Carlo draws of the parameter by combining parameter draws obtained from each subset. These combined parameter draws play the role of draws from the original sampling algorithm. Our main contributions are two-fold. First, we demonstrate through diverse simulated and real data analyses…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarkov Chains and Monte Carlo Methods · Statistical Methods and Inference · Gaussian Processes and Bayesian Inference
