Straggler Mitigation in Distributed Optimization Through Data Encoding

Can Karakus; Yifan Sun; Suhas Diggavi; Wotao Yin

arXiv:1711.04969·stat.ML·January 24, 2018·63 cites

Straggler Mitigation in Distributed Optimization Through Data Encoding

Can Karakus, Yifan Sun, Suhas Diggavi, Wotao Yin

PDF

Open Access

TL;DR

This paper introduces a novel data encoding approach for distributed optimization that mitigates stragglers by embedding redundancy directly in data, enabling robust convergence without waiting for slow nodes.

Contribution

It proposes data encoding schemes for distributed algorithms that allow convergence despite node variability, differing from prior coding-based methods that embed redundancy in computation.

Findings

01

Achieves linear convergence to an approximate solution with arbitrary node subsets.

02

Redundancy level controls the approximation accuracy.

03

Experimental results show advantages over uncoded and replication methods.

Abstract

Slow running or straggler tasks can significantly reduce computation speed in distributed computation. Recently, coding-theory-inspired approaches have been applied to mitigate the effect of straggling, through embedding redundancy in certain linear computational steps of the optimization algorithm, thus completing the computation without waiting for the stragglers. In this paper, we propose an alternate approach where we embed the redundancy directly in the data itself, and allow the computation to proceed completely oblivious to encoding. We propose several encoding schemes, and demonstrate that popular batch algorithms, such as gradient descent and L-BFGS, applied in a coding-oblivious manner, deterministically achieve sample path linear convergence to an approximate solution of the original problem, using an arbitrarily varying subset of the nodes at each iteration. Moreover, this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Machine Learning and ELM

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings