Robust Distributed Bayesian Learning with Stragglers via Consensus Monte   Carlo

Hari Hara Suthan Chittoor; Osvaldo Simeone

arXiv:2112.09794·cs.LG·August 30, 2022

Robust Distributed Bayesian Learning with Stragglers via Consensus Monte Carlo

Hari Hara Suthan Chittoor, Osvaldo Simeone

PDF

Open Access 1 Repo

TL;DR

This paper introduces two novel straggler-resilient methods for distributed Bayesian learning using consensus Monte Carlo, enhancing robustness and efficiency in the presence of slow or failing workers.

Contribution

It proposes G-CMC and C-CMC, two new algorithms that incorporate grouping and coding to mitigate stragglers in distributed Bayesian inference.

Findings

01

C-CMC outperforms G-CMC with fewer workers.

02

G-CMC is more effective with a larger number of workers.

03

Both methods improve robustness against stragglers in distributed Bayesian learning.

Abstract

This paper studies distributed Bayesian learning in a setting encompassing a central server and multiple workers by focusing on the problem of mitigating the impact of stragglers. The standard one-shot, or embarrassingly parallel, Bayesian learning protocol known as consensus Monte Carlo (CMC) is generalized by proposing two straggler-resilient solutions based on grouping and coding. Two main challenges in designing straggler-resilient algorithms for CMC are the need to estimate the statistics of the workers' outputs across multiple shots, and the joint non-linear post-processing of the outputs of the workers carried out at the server. This is in stark contrast to other distributed settings like gradient coding, which only require the per-shot sum of the workers' outputs. The proposed methods, referred to as Group-based CMC (G-CMC) and Coded CMC (C-CMC), leverage redundant computing at…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kclip/straggler-resilient-cmc
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTarget Tracking and Data Fusion in Sensor Networks · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning