Sparse-SignSGD with Majority Vote for Communication-Efficient Distributed Learning
Chanho Park, Namyoon Lee

TL;DR
This paper introduces ${ m S}^3$GD-MV, a communication-efficient distributed learning algorithm combining sparsification and sign quantization, achieving similar convergence rates to signSGD while reducing communication costs and improving accuracy.
Contribution
The paper proposes ${ m S}^3$GD-MV, a novel algorithm that integrates sparsification and majority vote for efficient distributed deep learning, with theoretical convergence guarantees and empirical improvements.
Findings
Converges at the same rate as signSGD under mild conditions.
Reduces communication costs significantly compared to signSGD.
Achieves higher accuracy on IID and non-IID datasets.
Abstract
The training efficiency of complex deep learning models can be significantly improved through the use of distributed optimization. However, this process is often hindered by a large amount of communication cost between workers and a parameter server during iterations. To address this bottleneck, in this paper, we present a new communication-efficient algorithm that offers the synergistic benefits of both sparsification and sign quantization, called GD-MV. The workers in GD-MV select the top- magnitude components of their local gradient vector and only send the signs of these components to the server. The server then aggregates the signs and returns the results via a majority vote rule. Our analysis shows that, under certain mild conditions, GD-MV can converge at the same rate as signSGD while significantly reducing communication costs, if the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Machine Learning and ELM · Privacy-Preserving Technologies in Data
