Bandwidth Cost of Code Conversions in the Split Regime

Francisco Maturana; K. V. Rashmi

arXiv:2205.06793·cs.IT·May 16, 2022

Bandwidth Cost of Code Conversions in the Split Regime

Francisco Maturana, K. V. Rashmi

PDF

Open Access

TL;DR

This paper analyzes the bandwidth required for code conversions in distributed storage, deriving bounds and proposing optimal constructions for the split regime where data is split into multiple codewords.

Contribution

It introduces lower bounds on conversion bandwidth and presents constructions that minimize data transfer during code conversions in the split regime.

Findings

01

Derived lower bounds on conversion bandwidth.

02

Proposed bandwidth-efficient code conversion constructions.

03

Identified optimal solutions for specific parameters.

Abstract

Distributed storage systems must store large amounts of data over long periods of time. To avoid data loss due to device failures, an $[n, k]$ erasure code is used to encode $k$ data symbols into a codeword of $n$ symbols that are stored across different devices. However, device failure rates change throughout the life of the data, and tuning $n$ and $k$ according to these changes has been shown to save significant storage space. Code conversion is the process of converting multiple codewords of an initial $[n^{I}, k^{I}]$ code into codewords of a final $[n^{F}, k^{F}]$ code that decode to the same set of data symbols. In this paper, we study conversion bandwidth, defined as the total amount of data transferred between nodes during conversion. In particular, we consider the case where the initial and final codes are MDS and a single initial codeword is split into several final codewords…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Data Storage Technologies · Caching and Content Delivery · Distributed systems and fault tolerance