Parallel Approaches to Accelerate Bayesian Decision Trees
Efthyvoulos Drousiotis, Paul G. Spirakis, and Simon Maskell

TL;DR
This paper explores parallel computing techniques to accelerate Bayesian decision trees, comparing data partitioning and Sequential Monte Carlo methods, with SMC showing significant runtime improvements in large data scenarios.
Contribution
It introduces two parallelization strategies for Bayesian decision trees, replacing MCMC with SMC and using data partitioning, demonstrating the effectiveness of SMC in HPC environments.
Findings
SMC can improve runtime by up to 343 times
Data partitioning has limited benefits in tested scenarios
SMC is inherently more parallelizable than MCMC
Abstract
Markov Chain Monte Carlo (MCMC) is a well-established family of algorithms primarily used in Bayesian statistics to sample from a target distribution when direct sampling is challenging. Existing work on Bayesian decision trees uses MCMC. Unfortunately, this can be slow, especially when considering large volumes of data. It is hard to parallelise the accept-reject component of the MCMC. None-the-less, we propose two methods for exploiting parallelism in the MCMC: in the first, we replace the MCMC with another numerical Bayesian approach, the Sequential Monte Carlo (SMC) sampler, which has the appealing property that it is an inherently parallel algorithm; in the second, we consider data partitioning. Both methods use multi-core processing with a HighPerformance Computing (HPC) resource. We test the two methods in various study settings to determine which method is the most beneficial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Statistical Methods and Inference · Data Quality and Management
MethodsTest
