Optimization-based Block Coordinate Gradient Coding
Qi Wang, Ying Cui, Chenglin Li, Junni Zou, Hongkai Xiong

TL;DR
This paper introduces an optimal gradient coding scheme with diverse redundancies across gradient coordinates, improving distributed machine learning efficiency by minimizing expected runtime under partial straggler models.
Contribution
It proposes a novel block coordinate gradient coding method that optimizes redundancy distribution across gradient coordinates, a first in this research area.
Findings
Optimal redundancy levels for different straggler tolerances identified.
Two low-complexity approximate solutions with closed-form expressions developed.
Expected runtime gaps between solutions and optimal are sub-linear in the number of workers.
Abstract
Existing gradient coding schemes introduce identical redundancy across the coordinates of gradients and hence cannot fully utilize the computation results from partial stragglers. This motivates the introduction of diverse redundancies across the coordinates of gradients. This paper considers a distributed computation system consisting of one master and workers characterized by a general partial straggler model and focuses on solving a general large-scale machine learning problem with model parameters. We show that it is sufficient to provide at most levels of redundancies for tolerating stragglers, respectively. Consequently, we propose an optimal block coordinate gradient coding scheme based on a stochastic optimization problem that optimizes the partition of the coordinates into blocks, each with identical redundancy, to minimize the expected…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Privacy-Preserving Technologies in Data · Complexity and Algorithms in Graphs
