On Convergence of FedProx: Local Dissimilarity Invariant Bounds, Non-smoothness and Beyond
Xiao-Tong Yuan, Ping Li

TL;DR
This paper develops a new convergence theory for FedProx in federated learning that is invariant to local dissimilarity and applicable to non-smooth problems, providing deeper insights and practical benefits.
Contribution
It introduces a local dissimilarity invariant convergence analysis for FedProx, extending understanding to non-smooth and non-convex federated optimization.
Findings
Convergence guarantees independent of local dissimilarity conditions
Applicable to non-smooth federated learning problems
Linear speedup with minibatch size and number of devices
Abstract
The FedProx algorithm is a simple yet powerful distributed proximal point optimization method widely used for federated learning (FL) over heterogeneous data. Despite its popularity and remarkable success witnessed in practice, the theoretical understanding of FedProx is largely underinvestigated: the appealing convergence behavior of FedProx is so far characterized under certain non-standard and unrealistic dissimilarity assumptions of local functions, and the results are limited to smooth optimization problems. In order to remedy these deficiencies, we develop a novel local dissimilarity invariant convergence theory for FedProx and its minibatch stochastic extension through the lens of algorithmic stability. As a result, we contribute to derive several new and deeper insights into FedProx for non-convex federated optimization including: 1) convergence guarantees independent on local…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Privacy-Preserving Technologies in Data · Sparse and Compressive Sensing Techniques
