In-Network Redundancy Generation for Opportunistic Speedup of Backup
Lluis Pamies-Juarez, Anwitaman Datta, Fr\'ed\'erique Oggier

TL;DR
This paper introduces an in-network redundancy generation method for distributed storage that improves throughput by enabling storage nodes to collaboratively generate redundancy, leveraging local repairability and asynchronous scheduling.
Contribution
It proposes a novel in-network redundancy generation approach using local repairable erasure codes, with algorithms to optimize resource utilization and significantly boost storage throughput.
Findings
Up to 90% throughput increase in data centers
Up to 60% throughput increase in peer-to-peer systems
Effective resource scheduling enhances redundancy generation efficiency
Abstract
Erasure coding is a storage-efficient alternative to replication for achieving reliable data backup in distributed storage systems. During the storage process, traditional erasure codes require a unique source node to create and upload all the redundant data to the different storage nodes. However, such a source node may have limited communication and computation capabilities, which constrain the storage process throughput. Moreover, the source node and the different storage nodes might not be able to send and receive data simultaneously -- e.g., nodes might be busy in a datacenter setting, or simply be offline in a peer-to-peer setting -- which can further threaten the efficacy of the overall storage process. In this paper we propose an "in-network" redundancy generation process which distributes the data insertion load among the source and storage nodes by allowing the storage nodes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
