Optimal Locally Repairable Codes and Connections to Matroid Theory

Itzhak Tamo; Dimitris S. Papailiopoulos; Alexandros G. Dimakis

arXiv:1301.7693·cs.IT·November 7, 2013·29 cites

Optimal Locally Repairable Codes and Connections to Matroid Theory

Itzhak Tamo, Dimitris S. Papailiopoulos, Alexandros G. Dimakis

PDF

Open Access

TL;DR

This paper introduces explicit, optimal locally repairable codes (LRCs) for distributed storage, leveraging Reed-Solomon codes and matroid theory to improve repair efficiency and approach fundamental bounds.

Contribution

It presents a simple, explicit construction of optimal LRCs based on Reed-Solomon codes and matroid analysis, filling a gap in existing code designs.

Findings

01

Constructed explicit optimal LRCs with low repair locality.

02

Derived a new matroid-based result for code generator matrices.

03

Demonstrated the practical relevance of LRCs in large-scale storage systems.

Abstract

Petabyte-scale distributed storage systems are currently transitioning to erasure codes to achieve higher storage efficiency. Classical codes like Reed-Solomon are highly sub-optimal for distributed environments due to their high overhead in single-failure events. Locally Repairable Codes (LRCs) form a new family of codes that are repair efficient. In particular, LRCs minimize the number of nodes participating in single node repairs during which they generate small network traffic. Two large-scale distributed storage systems have already implemented different types of LRCs: Windows Azure Storage and the Hadoop Distributed File System RAID used by Facebook. The fundamental bounds for LRCs, namely the best possible distance for a given code locality, were recently discovered, but few explicit constructions exist. In this work, we present an explicit and optimal LRCs that are simple to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Data Storage Technologies · Caching and Content Delivery · Distributed systems and fault tolerance