Evaluating Load Balancing Performance in Distributed Storage with Redundancy
Mehmet Fatih Aktas, Amir Behrouzi-Far, Emina Soljanin, Philip Whiting

TL;DR
This paper analyzes how redundancy level in distributed storage affects load balancing, showing that increasing redundancy improves load distribution exponentially up to a logarithmic scale, with implications for storage overhead and recovery.
Contribution
It provides a theoretical analysis of load balancing performance in redundant distributed storage systems, revealing how redundancy level impacts load distribution and system efficiency.
Findings
Load balance improves multiplicatively with redundancy d when d = o(log n).
Load balance improves exponentially with d when d = Θ(log n).
Redundancy reduces storage overhead multiplicatively but increases recovery load linearly.
Abstract
To facilitate load balancing, distributed systems store data redundantly. We evaluate the load balancing performance of storage schemes in which each object is stored at different nodes, and each node stores the same number of objects. In our model, the load offered for the objects is sampled uniformly at random from all the load vectors with a fixed cumulative value. We find that the load balance in a system of nodes improves multiplicatively with as long as , and improves exponentially once . We show that the load balance improves in the same way with when the service choices are created with XOR's of objects rather than object replicas. In such redundancy schemes, storage overhead is reduced multiplicatively by . However, recovery of an object requires downloading content from nodes. At the same…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
