Biased Compression in Gradient Coding for Distributed Learning
Chengxi Li, Ming Xiao, Mikael Skoglund

TL;DR
This paper introduces COCO-EF, a novel distributed learning method combining biased gradient compression with coding techniques, demonstrating improved efficiency and convergence guarantees over existing approaches.
Contribution
It is among the first to rigorously show the benefits of biased compression in gradient coding for distributed learning.
Findings
COCO-EF effectively mitigates stragglers and reduces communication costs.
Theoretical convergence guarantees are established for COCO-EF.
Empirical results show superior learning performance compared to baseline methods.
Abstract
Communication bottlenecks and the presence of stragglers pose significant challenges in distributed learning (DL). To deal with these challenges, recent advances leverage unbiased compression functions and gradient coding. However, the significant benefits of biased compression remain largely unexplored. To close this gap, we propose Compressed Gradient Coding with Error Feedback (COCO-EF), a novel DL method that combines gradient coding with biased compression to mitigate straggler effects and reduce communication costs. In each iteration, non-straggler devices encode local gradients from redundantly allocated training data, incorporate prior compression errors, and compress the results using biased compression functions before transmission. The server aggregates these compressed messages from the non-stragglers to approximate the global gradient for model updates. We provide rigorous…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Privacy-Preserving Technologies in Data · Sparse and Compressive Sensing Techniques
