Parallelizing MCMC with Random Partition Trees
Xiangyu Wang, Fangjian Guo, Katherine A. Heller, David B. Dunson

TL;DR
This paper introduces PART, a novel embarrassingly parallel MCMC method using random partition trees to improve accuracy, resampling, and scalability for large datasets in Bayesian inference.
Contribution
The paper proposes a new EP-MCMC algorithm that leverages random partition trees, addressing limitations of existing methods in approximation accuracy and resampling.
Findings
The PART algorithm is distribution-free and adaptable to multiple scales.
Extensive experiments show improved empirical performance over existing methods.
Theoretical analysis supports the effectiveness of the proposed approach.
Abstract
The modern scale of data has brought new challenges to Bayesian inference. In particular, conventional MCMC algorithms are computationally very expensive for large data sets. A promising approach to solve this problem is embarrassingly parallel MCMC (EP-MCMC), which first partitions the data into multiple subsets and runs independent sampling algorithms on each subset. The subset posterior draws are then aggregated via some combining rules to obtain the final approximation. Existing EP-MCMC algorithms are limited by approximation accuracy and difficulty in resampling. In this article, we propose a new EP-MCMC algorithm PART that solves these problems. The new algorithm applies random partition trees to combine the subset posterior draws, which is distribution-free, easy to resample from and can adapt to multiple scales. We provide theoretical justification and extensive experiments…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Markov Chains and Monte Carlo Methods · Gaussian Processes and Bayesian Inference
