Communication Compression for Decentralized Training
Hanlin Tang, Shaoduo Gan, Ce Zhang, Tong Zhang, Ji Liu

TL;DR
This paper introduces a framework for communication compression in decentralized training, combining techniques to handle both low bandwidth and high latency networks, and proposes algorithms with proven convergence rates.
Contribution
It develops a novel framework and algorithms for compressed decentralized training that effectively address both bandwidth and latency challenges, with theoretical convergence guarantees.
Findings
Algorithms converge at rate O(1/√nT) matching centralized training.
Proposed methods outperform existing decentralized and quantized algorithms.
Effective in networks with both high latency and low bandwidth.
Abstract
Optimizing distributed learning systems is an art of balancing between computation and communication. There have been two lines of research that try to deal with slower networks: {\em communication compression} for low bandwidth networks, and {\em decentralization} for high latency networks. In this paper, We explore a natural question: {\em can the combination of both techniques lead to a system that is robust to both bandwidth and latency?} Although the system implication of such combination is trivial, the underlying theoretical principle and algorithm design is challenging: unlike centralized algorithms, simply compressing exchanged information, even in an unbiased stochastic way, within the decentralized network would accumulate the error and fail to converge. In this paper, we develop a framework of compressed, decentralized training and propose two different strategies, which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Age of Information Optimization · Privacy-Preserving Technologies in Data
