Federated Accelerated Stochastic Gradient Descent
Honglin Yuan, Tengyu Ma

TL;DR
This paper introduces FedAc, an accelerated federated learning algorithm that improves convergence speed and reduces communication rounds compared to FedAvg, especially for convex functions, through novel stability and potential-based analysis.
Contribution
FedAc is the first provably accelerated version of FedAvg, achieving faster convergence and fewer communication rounds for convex optimization in federated learning.
Findings
FedAc achieves linear speedup with fewer rounds of communication.
FedAc outperforms FedAvg in convergence speed on convex functions.
Stronger guarantees for third-order smooth objectives.
Abstract
We propose Federated Accelerated Stochastic Gradient Descent (FedAc), a principled acceleration of Federated Averaging (FedAvg, also known as Local SGD) for distributed optimization. FedAc is the first provable acceleration of FedAvg that improves convergence speed and communication efficiency on various types of convex functions. For example, for strongly convex and smooth functions, when using workers, the previous state-of-the-art FedAvg analysis can achieve a linear speedup in if given rounds of synchronization, whereas FedAc only requires rounds. Moreover, we prove stronger guarantees for FedAc when the objectives are third-order smooth. Our technique is based on a potential-based perturbed iterate analysis, a novel stability analysis of generalized accelerated SGD, and a strategic tradeoff between acceleration and stability.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Privacy-Preserving Technologies in Data · Advanced Bandit Algorithms Research
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Stochastic Gradient Descent
