Sampling Parallelism for Fast and Efficient Bayesian Learning
Asena Karolin \"Ozdemir, Lars H. Heyen, Arvid Weyrauch, Achim Streit, Markus G\"otz, Charlotte Debus

TL;DR
This paper introduces sampling parallelism, a GPU-based parallelization method that accelerates Bayesian neural network training by distributing sample evaluations, reducing memory use, and enhancing uncertainty quantification.
Contribution
The authors propose a simple parallelization strategy for sampling-based Bayesian learning that improves scalability and reduces training time without architectural changes.
Findings
Sampling parallelism achieves near-perfect scaling with increased GPUs.
It reduces memory pressure and training time for Bayesian neural networks.
Combining sample and data parallelism enhances model diversity and convergence.
Abstract
Machine learning models, and deep neural networks in particular, are increasingly deployed in risk-sensitive domains such as healthcare, environmental forecasting, and finance, where reliable quantification of predictive uncertainty is essential. However, many uncertainty quantification (UQ) methods remain difficult to apply due to their substantial computational cost. Sampling-based Bayesian learning approaches, such as Bayesian neural networks (BNNs), are particularly expensive since drawing and evaluating multiple parameter samples rapidly exhausts memory and compute resources. These constraints have limited the accessibility and exploration of Bayesian techniques thus far. To address these challenges, we introduce sampling parallelism, a simple yet powerful parallelization strategy that targets the primary bottleneck of sampling-based Bayesian learning: the samples themselves. By…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
