Coded Data Rebalancing for Distributed Data Storage Systems with Cyclic Storage
Abhinav Vaishya, Athreya Chandramouli, Srikar Kale, Prasad Krishnan

TL;DR
This paper introduces efficient coded rebalancing schemes for cyclic balanced distributed storage systems, significantly reducing communication load during node changes with manageable file sizes, compared to previous exponential-size schemes.
Contribution
It presents novel coded rebalancing schemes for cyclic balanced databases with cubic file-size requirements, improving over prior exponential schemes, and provides bounds showing near-optimal performance.
Findings
Coded schemes reduce communication load during node removal.
Uncoded scheme is optimal for node addition.
Achievable loads are within a multiplicative gap of the lower bound.
Abstract
We consider replication-based distributed storage systems in which each node stores the same quantum of data and each data bit stored has the same replication factor across the nodes. Such systems are referred to as balanced distributed databases. When existing nodes leave or new nodes are added to this system, the balanced nature of the database is lost, either due to the reduction in the replication factor, or the non-uniformity of the storage at the nodes. This triggers a rebalancing algorithm, that exchanges data between the nodes so that the balance of the database is reinstated. The goal is then to design rebalancing schemes with minimal communication load. In a recent work by Krishnan et al., coded transmissions were used to rebalance a carefully designed distributed database from a node removal or addition. These coded rebalancing schemes have optimal communication load,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Storage Technologies · Cooperative Communication and Network Coding · Distributed systems and fault tolerance
