LDPC Code Design for Distributed Storage: Balancing Repair Bandwidth, Reliability and Storage Overhead
Hyegyeong Park, Dongwon Lee, Jaekyun Moon

TL;DR
This paper demonstrates that carefully designed LDPC codes can significantly reduce repair bandwidth and enhance reliability in distributed storage systems by optimizing the factor graph structure.
Contribution
It introduces a formula for repair bandwidth, shows how to design LDPC codes with minimal repair bandwidth, and compares their reliability to Reed-Solomon codes.
Findings
Regular check node degree minimizes repair bandwidth.
Regular variable node degree further reduces repair bandwidth.
LDPC codes can outperform Reed-Solomon codes in reliability.
Abstract
Distributed storage systems suffer from significant repair traffic generated due to frequent storage node failures. This paper shows that properly designed low-density parity-check (LDPC) codes can substantially reduce the amount of required block downloads for repair thanks to the sparse nature of their factor graph representation. In particular, with a careful construction of the factor graph, both low repair-bandwidth and high reliability can be achieved for a given code rate. First, a formula for the average repair bandwidth of LDPC codes is developed. This formula is then used to establish that the minimum repair bandwidth can be achieved by forcing a regular check node degree in the factor graph. Moreover, it is shown that given a fixed code rate, the variable node degree should also be regular to yield minimum repair bandwidth, under some reasonable minimum variable node degree…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
