Federated Learning via Posterior Averaging: A New Perspective and Practical Algorithms
Maruan Al-Shedivat, Jennifer Gillenwater, Eric Xing, Afshin, Rostamizadeh

TL;DR
This paper introduces a novel perspective on federated learning as a posterior inference problem, developing an efficient algorithm called FedPA that improves convergence and results on benchmark datasets.
Contribution
It formulates federated learning as a posterior inference task and proposes FedPA, a new algorithm that generalizes FedAvg and enhances convergence and accuracy.
Findings
FedPA converges faster than FedAvg.
FedPA achieves better optima on benchmarks.
FedPA benefits from adaptive optimizers.
Abstract
Federated learning is typically approached as an optimization problem, where the goal is to minimize a global loss function by distributing computation across client devices that possess local data and specify different parts of the global objective. We present an alternative perspective and formulate federated learning as a posterior inference problem, where the goal is to infer a global posterior distribution by having client devices each infer the posterior of their local data. While exact inference is often intractable, this perspective provides a principled way to search for global optima in federated settings. Further, starting with the analysis of federated quadratic objectives, we develop a computation- and communication-efficient approximate posterior inference algorithm -- federated posterior averaging (FedPA). Our algorithm uses MCMC for approximate inference of local…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Domain Adaptation and Few-Shot Learning
