MDS Codes with Progressive Engagement Property for Cloud Storage Systems
Mahdi Hajiaghayi, Hamid Jafarkhani

TL;DR
This paper introduces a new class of MDS codes called rotation codes with the progressive engagement property, enabling flexible and efficient failure recovery in cloud storage by incrementally involving surviving nodes based on their access costs.
Contribution
The paper proposes rotation codes that combine MDS and progressive engagement properties, addressing limitations of existing codes for flexible, cost-effective failure recovery in cloud storage systems.
Findings
Rotation codes achieve near-optimal repair bandwidth with varying participating nodes.
Simulation shows improved recovery speed and bandwidth efficiency.
Permutation codes can be adapted for progressive engagement with modifications.
Abstract
Fast and efficient failure recovery is a new challenge for cloud storage systems with a large number of storage nodes. A pivotal recovery metric upon the failure of a storage node is repair bandwidth cost which refers to the amount of data that must be downloaded for regenerating the lost data. Since all the surviving nodes are not always accessible, we intend to introduce a class of maximum distance separable (MDS) codes that can be re-used when the number of selected nodes varies yet yields close to optimal repair bandwidth. Such codes provide flexibility in engaging more surviving nodes in favor of reducing the repair bandwidth without redesigning the code structure and changing the content of the existing nodes. We call this property of MDS codes progressive engagement. This name comes from the fact that if a failure occurs, it is shown that the best strategy is to incrementally…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Storage Technologies · Distributed systems and fault tolerance · Caching and Content Delivery
