Controlling Data Access Load in Distributed Systems

Mehmet Aktas; Emina Soljanin

arXiv:2312.10360·cs.DC·December 19, 2023·1 cites

Controlling Data Access Load in Distributed Systems

Mehmet Aktas, Emina Soljanin

PDF

Open Access

TL;DR

This paper analyzes how storage redundancy levels and data object assignment strategies affect load balancing in distributed systems, providing theoretical bounds and insights for different storage schemes.

Contribution

It introduces a formal analysis of load balancing in distributed storage, deriving necessary redundancy levels and comparing different data assignment schemes.

Findings

01

Redundancy factor d must be at least logarithmic in number of nodes for load balance.

02

Clustering and cyclic designs require higher redundancy (Ω(log n)) for effective load balancing.

03

Random and block designs can achieve load balance with lower or sufficient redundancy, depending on the scheme.

Abstract

Distributed systems store data objects redundantly to balance the data access load over multiple nodes. Load balancing performance depends mainly on 1) the level of storage redundancy and 2) the assignment of data objects to storage nodes. We analyze the performance implications of these design choices by considering four practical storage schemes that we refer to as clustering, cyclic, block and random design. We formulate the problem of load balancing as maintaining the load on any node below a given threshold. Regarding the level of redundancy, we find that the desired load balance can be achieved in a system of $n$ nodes only if the replication factor $d = Ω (lo g (n)^{1/3})$ , which is a necessary condition for any storage design. For clustering and cyclic designs, $d = Ω (lo g (n))$ is necessary and sufficient. For block and random designs, $d = Ω (lo g (n))$ is sufficient…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDistributed systems and fault tolerance · Distributed and Parallel Computing Systems · Advanced Database Systems and Queries