Distributed Data Storage with Minimum Storage Regenerating Codes - Exact and Functional Repair are Asymptotically Equally Efficient
Viveck R. Cadambe, Syed A. Jafar, Hamed Maleki

TL;DR
This paper demonstrates that for distributed storage systems using MSR codes, the bandwidth required for exact node repair asymptotically matches that of functional repair across all parameters, including previously unresolved cases.
Contribution
It introduces an interference alignment scheme to achieve optimal repair bandwidth for all (n,k) cases, resolving open problems in low-redundancy regimes.
Findings
Exact and functional repair bandwidths are asymptotically equal.
The scheme works for all (n,k) configurations, including k > max(n/2,3).
Bandwidth ratio approaches (n-1)/(k(n-k)) as file size grows large.
Abstract
We consider a set up where a file of size M is stored in n distributed storage nodes, using an (n,k) minimum storage regenerating (MSR) code, i.e., a maximum distance separable (MDS) code that also allows efficient exact-repair of any failed node. The problem of interest in this paper is to minimize the repair bandwidth B for exact regeneration of a single failed node, i.e., the minimum data to be downloaded by a new node to replace the failed node by its exact replica. Previous work has shown that a bandwidth of B=[M(n-1)]/[k(n-k)] is necessary and sufficient for functional (not exact) regeneration. It has also been shown that if k < = max(n/2, 3), then there is no extra cost of exact regeneration over functional regeneration. The practically relevant setting of low-redundancy, i.e., k/n>1/2 remains open for k>3 and it has been shown that there is an extra bandwidth cost for exact…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Storage Technologies · Caching and Content Delivery · Cooperative Communication and Network Coding
