Communication-Efficient Local Decentralized SGD Methods
Xiang Li, Wenhao Yang, Shusen Wang, Zhihua Zhang

TL;DR
This paper introduces LD-SGD, a flexible decentralized optimization algorithm that combines local updates and decentralized communication, providing convergence guarantees and insights for improving communication efficiency in non-convex, distributed training.
Contribution
It proposes LD-SGD, a novel algorithm with an analytical framework, and demonstrates how different update schemes enhance communication efficiency in decentralized settings.
Findings
LD-SGD converges to critical points under broad conditions.
Alternating local and global updates balances communication and computation.
Decaying local update length improves communication efficiency empirically.
Abstract
Recently, the technique of local updates is a powerful tool in centralized settings to improve communication efficiency via periodical communication. For decentralized settings, it is still unclear how to efficiently combine local updates and decentralized communication. In this work, we propose an algorithm named as LD-SGD, which incorporates arbitrary update schemes that alternate between multiple Local updates and multiple Decentralized SGDs, and provide an analytical framework for LD-SGD. Under the framework, we present a sufficient condition to guarantee the convergence. We show that LD-SGD converges to a critical point for a wide range of update schemes when the objective is non-convex and the training data are non-identically independent distributed. Moreover, our framework brings many insights into the design of update schemes for decentralized optimization. As examples, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIndoor and Outdoor Localization Technologies · Robotics and Sensor-Based Localization · Sparse and Compressive Sensing Techniques
MethodsStochastic Gradient Descent
