Robust Distributed Bayesian Learning with Stragglers via Consensus Monte Carlo
Hari Hara Suthan Chittoor, Osvaldo Simeone

TL;DR
This paper introduces two novel straggler-resilient methods for distributed Bayesian learning using consensus Monte Carlo, enhancing robustness and efficiency in the presence of slow or failing workers.
Contribution
It proposes G-CMC and C-CMC, two new algorithms that incorporate grouping and coding to mitigate stragglers in distributed Bayesian inference.
Findings
C-CMC outperforms G-CMC with fewer workers.
G-CMC is more effective with a larger number of workers.
Both methods improve robustness against stragglers in distributed Bayesian learning.
Abstract
This paper studies distributed Bayesian learning in a setting encompassing a central server and multiple workers by focusing on the problem of mitigating the impact of stragglers. The standard one-shot, or embarrassingly parallel, Bayesian learning protocol known as consensus Monte Carlo (CMC) is generalized by proposing two straggler-resilient solutions based on grouping and coding. Two main challenges in designing straggler-resilient algorithms for CMC are the need to estimate the statistics of the workers' outputs across multiple shots, and the joint non-linear post-processing of the outputs of the workers carried out at the server. This is in stark contrast to other distributed settings like gradient coding, which only require the per-shot sum of the workers' outputs. The proposed methods, referred to as Group-based CMC (G-CMC) and Coded CMC (C-CMC), leverage redundant computing at…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTarget Tracking and Data Fusion in Sensor Networks · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning
