Network Coding for Distributed Storage Systems
Alexandros G. Dimakis, P. Brighten Godfrey, Yunnan Wu, Martin J., Wainwright, Kannan Ramchandran

TL;DR
This paper introduces regenerating codes for distributed storage systems, enabling efficient data repair by reducing bandwidth and balancing storage costs, supported by theoretical analysis and network coding constructions.
Contribution
It proposes the concept of regenerating codes, characterizes the fundamental tradeoff between storage and repair bandwidth, and provides explicit code constructions achieving optimal points.
Findings
Regenerating codes reduce repair bandwidth significantly.
A fundamental tradeoff between storage and repair bandwidth is established.
Constructive codes achieve optimal points on the tradeoff curve.
Abstract
Distributed storage systems provide reliable access to data through redundancy spread over individually unreliable nodes. Application scenarios include data centers, peer-to-peer storage systems, and storage in wireless networks. Storing data using an erasure code, in fragments spread across nodes, requires less redundancy than simple replication for the same level of reliability. However, since fragments must be periodically replaced as nodes fail, a key question is how to generate encoded fragments in a distributed way while transferring as little data as possible across the network. For an erasure coded system, a common practice to repair from a node failure is for a new node to download subsets of data stored at a number of surviving nodes, reconstruct a lost coded block using the downloaded data, and store it at the new node. We show that this procedure is sub-optimal. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCooperative Communication and Network Coding · Caching and Content Delivery · Advanced Data Storage Technologies
