Block-Diagonal and LT Codes for Distributed Computing With Straggling Servers
Albin Severinson, Alexandre Graell i Amat, Eirik Rosnes

TL;DR
This paper introduces two novel coded schemes for distributed matrix-vector multiplication, improving computational delay and performance under deadlines by using partitioned MDS codes and LT codes, outperforming existing methods.
Contribution
The paper presents a partitioned MDS code scheme with lower overall delay and an LT code-based scheme that reduces delay further, both tailored for distributed computing with straggling servers.
Findings
Partitioned MDS codes match existing schemes in communication and delay but lower total delay.
LT codes reduce delay at the cost of increased communication load.
Proposed schemes outperform existing methods under deadline constraints.
Abstract
We propose two coded schemes for the distributed computing problem of multiplying a matrix by a set of vectors. The first scheme is based on partitioning the matrix into submatrices and applying maximum distance separable (MDS) codes to each submatrix. For this scheme, we prove that up to a given number of partitions the communication load and the computational delay (not including the encoding and decoding delay) are identical to those of the scheme recently proposed by Li et al., based on a single, long MDS code. However, due to the use of shorter MDS codes, our scheme yields a significantly lower overall computational delay when the delay incurred by encoding and decoding is also considered. We further propose a second coded scheme based on Luby Transform (LT) codes under inactivation decoding. Interestingly, LT codes may reduce the delay over the partitioned scheme at the expense of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
