Optimal Systematic Distributed Storage Codes with Fast Encoding
Preetum Nakkiran, K.V. Rashmi, Kannan Ramchandran

TL;DR
This paper introduces a novel explicit erasure code construction for distributed storage that simultaneously achieves maximum reliability, minimal repair bandwidth, repair flexibility, systematic data storage, and fast encoding, addressing key practical needs.
Contribution
It provides the first explicit code construction that guarantees all five desirable properties simultaneously, building on and extending Product-Matrix codes.
Findings
Achieves all five properties simultaneously in a single code.
Enables fast encoding with about 4x speedup in practice.
Provides a framework for transforming existing codes for faster encoding.
Abstract
Erasure codes are being increasingly used in distributed-storage systems in place of data-replication, since they provide the same level of reliability with much lower storage overhead. We consider the problem of constructing explicit erasure codes for distributed storage with the following desirable properties motivated by practice: (i) Maximum-Distance-Separable (MDS): to provide maximal reliability at minimum storage overhead, (ii) Optimal repair-bandwidth: to minimize the amount of data needed to be transferred to repair a failed node from remaining ones, (iii) Flexibility in repair: to allow maximal flexibility in selecting subset of nodes to use for repair, which includes not requiring that all surviving nodes be used for repair, (iv) Systematic Form: to ensure that the original data exists in uncoded form, and (v) Fast encoding: to minimize the cost of generating encoded data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
