Block-Diagonal and LT Codes for Distributed Computing With Straggling   Servers

Albin Severinson; Alexandre Graell i Amat; Eirik Rosnes

arXiv:1712.08230·cs.IT·October 22, 2018

Block-Diagonal and LT Codes for Distributed Computing With Straggling Servers

Albin Severinson, Alexandre Graell i Amat, Eirik Rosnes

PDF

TL;DR

This paper introduces two novel coded schemes for distributed matrix-vector multiplication, improving computational delay and performance under deadlines by using partitioned MDS codes and LT codes, outperforming existing methods.

Contribution

The paper presents a partitioned MDS code scheme with lower overall delay and an LT code-based scheme that reduces delay further, both tailored for distributed computing with straggling servers.

Findings

01

Partitioned MDS codes match existing schemes in communication and delay but lower total delay.

02

LT codes reduce delay at the cost of increased communication load.

03

Proposed schemes outperform existing methods under deadline constraints.

Abstract

We propose two coded schemes for the distributed computing problem of multiplying a matrix by a set of vectors. The first scheme is based on partitioning the matrix into submatrices and applying maximum distance separable (MDS) codes to each submatrix. For this scheme, we prove that up to a given number of partitions the communication load and the computational delay (not including the encoding and decoding delay) are identical to those of the scheme recently proposed by Li et al., based on a single, long MDS code. However, due to the use of shorter MDS codes, our scheme yields a significantly lower overall computational delay when the delay incurred by encoding and decoding is also considered. We further propose a second coded scheme based on Luby Transform (LT) codes under inactivation decoding. Interestingly, LT codes may reduce the delay over the partitioned scheme at the expense of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.