FedPD: A Federated Learning Framework with Optimal Rates and Adaptivity to Non-IID Data
Xinwei Zhang, Mingyi Hong, Sairaj Dhople, Wotao Yin, Yang Liu

TL;DR
This paper introduces FedPD, a federated learning framework that achieves optimal convergence rates and adaptivity to non-IID data, overcoming limitations of existing algorithms like FedAvg, especially for non-convex problems.
Contribution
Proposes a primal-dual based federated learning framework that guarantees fast convergence, minimal assumptions, and adaptivity to data heterogeneity, a first in the field.
Findings
Handles non-convex objectives effectively
Achieves optimal communication and optimization complexity
Adapts communication patterns based on data heterogeneity
Abstract
Federated Learning (FL) has become a popular paradigm for learning from distributed data. To effectively utilize data at different devices without moving them to the cloud, algorithms such as the Federated Averaging (FedAvg) have adopted a "computation then aggregation" (CTA) model, in which multiple local updates are performed using local data, before sending the local models to the cloud for aggregation. However, these schemes typically require strong assumptions, such as the local data are identically independent distributed (i.i.d), or the size of the local gradients are bounded. In this paper, we first explicitly characterize the behavior of the FedAvg algorithm, and show that without strong and unrealistic assumptions on the problem structure, the algorithm can behave erratically for non-convex problems (e.g., diverge to infinity). Aiming at designing FL algorithms that are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Privacy-Preserving Technologies in Data · Sparse and Compressive Sensing Techniques
