Communication-Efficient Approximate Gradient Coding
Sifat Munim, Aditya Ramamoorthy

TL;DR
This paper introduces communication-efficient approximate gradient coding schemes for distributed learning, using structured matrices and probabilistic analysis to ensure convergence despite worker failures and communication constraints.
Contribution
It systematically develops approximate gradient coding schemes with tight error bounds, extending the existing exact recovery methods to more communication-efficient and resilient solutions.
Findings
Proposed structured matrix-based gradient coding schemes with bounded approximation error.
Derived tight analytical bounds and lower bounds on the approximation error.
Proved convergence of the learning algorithm using the proposed schemes under probabilistic failure models.
Abstract
Large-scale distributed learning aims at minimizing a loss function that depends on a training dataset with respect to a -length parameter vector. The distributed cluster typically consists of a parameter server (PS) and multiple workers. Gradient coding is a technique that makes the learning process resilient to straggling workers. It introduces redundancy within the assignment of data points to the workers and uses coding theoretic ideas so that the PS can recover exactly or approximately, even in the presence of stragglers. Communication-efficient gradient coding allows the workers to communicate vectors of length smaller than to the PS, thus reducing the communication time. While there have been schemes that address the exact recovery of within communication-efficient gradient coding, to the best of our knowledge the approximate variant has not been…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Privacy-Preserving Technologies in Data · Complexity and Algorithms in Graphs
