HashTag Erasure Codes: From Theory to Practice
Katina Kralevska, Danilo Gligoroski, Rune E. Jensen, and Harald, {\O}verby

TL;DR
HashTag Erasure Codes (HTECs) are a practical high-rate MDS code that significantly reduce repair bandwidth and I/O operations in distributed storage, outperforming existing MSR codes especially for multiple failures.
Contribution
This paper introduces HashTag Erasure Codes (HTECs), a new class of high-rate MDS codes optimized for storage, reliability, and repair efficiency, with practical Hadoop implementation.
Findings
HTECs achieve the lowest data-read and transfer among MDS codes.
HTECs reduce repair bandwidth for multiple failures.
Practical Hadoop implementation demonstrates high potential.
Abstract
Minimum-Storage Regenerating (MSR) codes have emerged as a viable alternative to Reed-Solomon (RS) codes as they minimize the repair bandwidth while they are still optimal in terms of reliability and storage overhead. Although several MSR constructions exist, so far they have not been practically implemented mainly due to the big number of I/O operations. In this paper, we analyze high-rate MDS codes that are simultaneously optimized in terms of storage, reliability, I/O operations, and repair-bandwidth for single and multiple failures of the systematic nodes. The codes were recently introduced in \cite{7463553} without any specific name. Due to the resemblance between the hashtag sign \# and the procedure of the code construction, we call them in this paper \emph{HashTag Erasure Codes (HTECs)}. HTECs provide the lowest data-read and data-transfer, and thus the lowest repair time for an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
