Server-Side Stepsizes and Sampling Without Replacement Provably Help in Federated Optimization
Grigory Malinovsky, Konstantin Mishchenko, Peter Richt\'arik

TL;DR
This paper provides a theoretical analysis showing that server-side stepsizes and sampling without replacement significantly improve federated learning efficiency, especially with Random Reshuffling and adaptive stepsizes.
Contribution
It introduces the first provable benefits of server-side stepsizes and sampling without replacement in federated averaging, enhancing convergence rates for various objectives.
Findings
Improved convergence rates from O(ε^{-3}) to O(ε^{-2}) in non-convex settings.
Large server-side stepsizes help overcome communication bottlenecks.
Random Reshuffling over clients improves optimization complexities.
Abstract
We present a theoretical study of server-side optimization in federated learning. Our results are the first to show that the widely popular heuristic of scaling the client updates with an extra parameter is very useful in the context of Federated Averaging (FedAvg) with local passes over the client data. Each local pass is performed without replacement using Random Reshuffling, which is a key reason we can show improved complexities. In particular, we prove that whenever the local stepsizes are small, and the update direction is given by FedAvg in conjunction with Random Reshuffling over all clients, one can take a big leap in the obtained direction and improve rates for convex, strongly convex, and non-convex objectives. In particular, in non-convex regime we get an enhancement of the rate of convergence from to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Privacy-Preserving Technologies in Data · Cooperative Communication and Network Coding
