Bandwidth Cost of Code Conversions in the Split Regime
Francisco Maturana, K. V. Rashmi

TL;DR
This paper analyzes the bandwidth required for code conversions in distributed storage, deriving bounds and proposing optimal constructions for the split regime where data is split into multiple codewords.
Contribution
It introduces lower bounds on conversion bandwidth and presents constructions that minimize data transfer during code conversions in the split regime.
Findings
Derived lower bounds on conversion bandwidth.
Proposed bandwidth-efficient code conversion constructions.
Identified optimal solutions for specific parameters.
Abstract
Distributed storage systems must store large amounts of data over long periods of time. To avoid data loss due to device failures, an erasure code is used to encode data symbols into a codeword of symbols that are stored across different devices. However, device failure rates change throughout the life of the data, and tuning and according to these changes has been shown to save significant storage space. Code conversion is the process of converting multiple codewords of an initial code into codewords of a final code that decode to the same set of data symbols. In this paper, we study conversion bandwidth, defined as the total amount of data transferred between nodes during conversion. In particular, we consider the case where the initial and final codes are MDS and a single initial codeword is split into several final codewords…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Storage Technologies · Caching and Content Delivery · Distributed systems and fault tolerance
