Hierarchical Coded Matrix Multiplication
Shahrzad Kiani, Nuwan Ferdinand, Stark C. Draper

TL;DR
This paper introduces a hierarchical coding approach for distributed matrix multiplication that leverages work done by slower nodes, resulting in significant performance improvements over traditional methods that ignore stragglers.
Contribution
It proposes a novel hierarchical coding scheme that exploits partial work from stragglers, improving distributed matrix multiplication efficiency in heterogeneous environments.
Findings
Achieves 60% reduction in expected finishing time in simulations.
Realizes 35% speedup on Amazon EC2 cloud.
Outperforms existing straggler-resistant methods by utilizing partial computations.
Abstract
Slow working nodes, known as stragglers, can greatly reduce the speed of distributed computation. Coded matrix multiplication is a recently introduced technique that enables straggler-resistant distributed multiplication of large matrices. A key property is that the finishing time depends only on the work completed by a set of the fastest workers, while the work done by the slowest workers is ignored completely. This paper is motivated by the observation that in real-world commercial cloud computing systems such as Amazon's Elastic Compute Cloud (EC2) the distinction between fast and slow nodes is often a soft one. Thus, if we could also exploit the work completed by stragglers we may realize substantial performance gains. To realize such gains, in this paper we use the idea of hierarchical coding (Ferdinand and Draper, IEEE Int. Symp. Inf. Theory, 2018). We decompose the overall matrix…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Privacy-Preserving Technologies in Data
