The Storage vs Repair Bandwidth Trade-off for Multiple Failures in Clustered Storage Networks
Vitaly Abdrashitov, N. Prakash, Muriel M\'edard

TL;DR
This paper investigates the balance between storage efficiency and repair bandwidth in clustered storage systems, especially when repairing multiple node failures, providing bounds and insights into optimal repair strategies.
Contribution
It characterizes the optimal storage-bandwidth trade-off for multiple failures in clustered storage, including exact and functional repair, revealing key conditions affecting system capacity.
Findings
Trade-off same as single failure when t divides (m-ell)
Exact repair at MBR can have less file size than functional repair
More local helpers do not always increase capacity under functional repair
Abstract
We study the trade-off between storage overhead and inter-cluster repair bandwidth in clustered storage systems, while recovering from multiple node failures within a cluster. A cluster is a collection of nodes, and there are clusters. For data collection, we download the entire content from any clusters. For repair of nodes within a cluster, we take help from local nodes, as well as helper clusters. We characterize the optimal trade-off under functional repair, and also under exact repair for the minimum storage and minimum inter-cluster bandwidth (MBR) operating points. Our bounds show the following interesting facts: When the trade-off is the same as that under , and thus there is no advantage in jointly repairing multiple nodes, When , the optimal file-size at the MBR point under exact repair can be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
