Federated Optimization in Heterogeneous Networks
Tian Li, Anit Kumar Sahu, Manzil Zaheer, Maziar Sanjabi, Ameet, Talwalkar, Virginia Smith

TL;DR
This paper introduces FedProx, a framework for federated learning that addresses both systems and statistical heterogeneity, providing convergence guarantees and improved robustness over FedAvg in diverse, real-world datasets.
Contribution
FedProx generalizes FedAvg to better handle heterogeneity, with theoretical convergence guarantees and practical improvements in stability and accuracy.
Findings
FedProx converges more reliably than FedAvg in heterogeneous settings.
FedProx improves test accuracy by 22% on average in diverse datasets.
The framework accommodates variable work across devices, enhancing robustness.
Abstract
Federated Learning is a distributed learning paradigm with two key challenges that differentiate it from traditional distributed optimization: (1) significant variability in terms of the systems characteristics on each device in the network (systems heterogeneity), and (2) non-identically distributed data across the network (statistical heterogeneity). In this work, we introduce a framework, FedProx, to tackle heterogeneity in federated networks. FedProx can be viewed as a generalization and re-parametrization of FedAvg, the current state-of-the-art method for federated learning. While this re-parameterization makes only minor modifications to the method itself, these modifications have important ramifications both in theory and in practice. Theoretically, we provide convergence guarantees for our framework when learning over data from non-identical distributions (statistical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Advanced MIMO Systems Optimization
MethodsProximity Regularization
