Likelihood Inflating Sampling Algorithm
Reihaneh Entezari, Radu V. Craiu, and Jeffrey S. Rosenthal

TL;DR
LISA is a parallel MCMC method that reduces computational costs for large datasets by splitting data, running independent chains on subsets, and combining results to approximate the full posterior efficiently.
Contribution
Introduces LISA, a communication-free parallel algorithm that inflates likelihoods and combines sub-posterior samples, enabling scalable Bayesian inference for large data sets.
Findings
LISA significantly reduces computation time compared to traditional MCMC.
The method effectively approximates the full posterior in Bayesian Additive Regression Trees.
Performance validated on both simulated and real datasets.
Abstract
Markov Chain Monte Carlo (MCMC) sampling from a posterior distribution corresponding to a massive data set can be computationally prohibitive since producing one sample requires a number of operations that is linear in the data size. In this paper, we introduce a new communication-free parallel method, the Likelihood Inflating Sampling Algorithm (LISA), that significantly reduces computational costs by randomly splitting the dataset into smaller subsets and running MCMC methods independently in parallel on each subset using different processors. Each processor will be used to run an MCMC chain that samples sub-posterior distributions which are defined using an "inflated" likelihood function. We develop a strategy for combining the draws from different sub-posteriors to study the full posterior of the Bayesian Additive Regression Trees (BART) model. The performance of the method is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
