Federated Optimization in Heterogeneous Networks

Tian Li; Anit Kumar Sahu; Manzil Zaheer; Maziar Sanjabi; Ameet; Talwalkar; Virginia Smith

arXiv:1812.06127·cs.LG·April 23, 2020·452 cites

Federated Optimization in Heterogeneous Networks

Tian Li, Anit Kumar Sahu, Manzil Zaheer, Maziar Sanjabi, Ameet, Talwalkar, Virginia Smith

PDF

Open Access 5 Repos

TL;DR

This paper introduces FedProx, a framework for federated learning that addresses both systems and statistical heterogeneity, providing convergence guarantees and improved robustness over FedAvg in diverse, real-world datasets.

Contribution

FedProx generalizes FedAvg to better handle heterogeneity, with theoretical convergence guarantees and practical improvements in stability and accuracy.

Findings

01

FedProx converges more reliably than FedAvg in heterogeneous settings.

02

FedProx improves test accuracy by 22% on average in diverse datasets.

03

The framework accommodates variable work across devices, enhancing robustness.

Abstract

Federated Learning is a distributed learning paradigm with two key challenges that differentiate it from traditional distributed optimization: (1) significant variability in terms of the systems characteristics on each device in the network (systems heterogeneity), and (2) non-identically distributed data across the network (statistical heterogeneity). In this work, we introduce a framework, FedProx, to tackle heterogeneity in federated networks. FedProx can be viewed as a generalization and re-parametrization of FedAvg, the current state-of-the-art method for federated learning. While this re-parameterization makes only minor modifications to the method itself, these modifications have important ramifications both in theory and in practice. Theoretically, we provide convergence guarantees for our framework when learning over data from non-identical distributions (statistical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Advanced MIMO Systems Optimization

MethodsProximity Regularization