The Role of Local Steps in Local SGD
Tiancheng Qin, S. Rasoul Etesami, C\'esar A. Uribe

TL;DR
This paper analyzes how varying the number of local steps in Local SGD affects convergence and communication efficiency in distributed optimization, proposing strategies for different function types.
Contribution
It characterizes the convergence rate of Local SGD with arbitrary local steps and introduces a new increasing local steps strategy for strongly convex functions.
Findings
Increasing local steps can lead to linear speed-up for strongly convex functions.
Fixed local steps are optimal for convex and nonconvex functions.
Theoretical analysis is supported by extensive numerical experiments.
Abstract
We consider the distributed stochastic optimization problem where agents want to minimize a global function given by the sum of agents' local functions, and focus on the heterogeneous setting when agents' local functions are defined over non-i.i.d. data sets. We study the Local SGD method, where agents perform a number of local stochastic gradient steps and occasionally communicate with a central node to improve their local optimization tasks. We analyze the effect of local steps on the convergence rate and the communication complexity of Local SGD. In particular, instead of assuming a fixed number of local steps across all communication rounds, we allow the number of local steps during the -th communication round, , to be different and arbitrary numbers. Our main contribution is to characterize the convergence rate of Local SGD as a function of under…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Privacy-Preserving Technologies in Data · Sparse and Compressive Sensing Techniques
MethodsLocal SGD · Stochastic Gradient Descent
