Erasure coding for distributed matrix multiplication for matrices with   bounded entries

Li Tang; Konstantinos Konstantinidis; Aditya Ramamoorthy

arXiv:1811.02144·cs.DC·November 8, 2018·1 cites

Erasure coding for distributed matrix multiplication for matrices with bounded entries

Li Tang, Konstantinos Konstantinidis, Aditya Ramamoorthy

PDF

Open Access

TL;DR

This paper introduces a new erasure coding strategy for distributed matrix multiplication that leverages bounds on matrix entries to optimize the recovery threshold, reducing the impact of stragglers in distributed computations.

Contribution

It proposes a novel coding approach that exploits bounds on matrix entries to improve straggler mitigation in distributed matrix multiplication.

Findings

01

Tradeoff between entry bounds and recovery threshold demonstrated.

02

Method achieves optimal recovery threshold under certain bounds.

03

Experimental validation on cloud clusters confirms effectiveness.

Abstract

Distributed matrix multiplication is widely used in several scientific domains. It is well recognized that computation times on distributed clusters are often dominated by the slowest workers (called stragglers). Recent work has demonstrated that straggler mitigation can be viewed as a problem of designing erasure codes. For matrices $A$ and $B$ , the technique essentially maps the computation of $A^{T} B$ into the multiplication of smaller (coded) submatrices. The stragglers are treated as erasures in this process. The computation can be completed as long as a certain number of workers (called the recovery threshold) complete their assigned tasks. We present a novel coding strategy for this problem when the absolute values of the matrix entries are sufficiently small. We demonstrate a tradeoff between the assumed absolute value bounds on the matrix…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Privacy-Preserving Technologies in Data · Caching and Content Delivery