Biased Compression in Gradient Coding for Distributed Learning

Chengxi Li; Ming Xiao; Mikael Skoglund

arXiv:2603.16353·cs.DC·March 18, 2026·IEEE Trans. Signal Process.

Biased Compression in Gradient Coding for Distributed Learning

Chengxi Li, Ming Xiao, Mikael Skoglund

PDF

Open Access

TL;DR

This paper introduces COCO-EF, a novel distributed learning method combining biased gradient compression with coding techniques, demonstrating improved efficiency and convergence guarantees over existing approaches.

Contribution

It is among the first to rigorously show the benefits of biased compression in gradient coding for distributed learning.

Findings

01

COCO-EF effectively mitigates stragglers and reduces communication costs.

02

Theoretical convergence guarantees are established for COCO-EF.

03

Empirical results show superior learning performance compared to baseline methods.

Abstract

Communication bottlenecks and the presence of stragglers pose significant challenges in distributed learning (DL). To deal with these challenges, recent advances leverage unbiased compression functions and gradient coding. However, the significant benefits of biased compression remain largely unexplored. To close this gap, we propose Compressed Gradient Coding with Error Feedback (COCO-EF), a novel DL method that combines gradient coding with biased compression to mitigate straggler effects and reduce communication costs. In each iteration, non-straggler devices encode local gradients from redundantly allocated training data, incorporate prior compression errors, and compress the results using biased compression functions before transmission. The server aggregates these compressed messages from the non-stragglers to approximate the global gradient for model updates. We provide rigorous…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Privacy-Preserving Technologies in Data · Sparse and Compressive Sensing Techniques