A lower bound on the queueing delay in resource constrained load balancing
David Gamarnik, John N. Tsitsiklis, Martin Zubeldia

TL;DR
This paper establishes a fundamental lower bound on the resources needed for load balancing policies to achieve vanishing queueing delay in large-scale distributed systems, showing that certain resource constraints prevent delay reduction.
Contribution
It proves that symmetric dispatching policies with limited memory and message exchange cannot achieve zero queueing delay as the system scales.
Findings
Finite memory and message rate policies cannot eliminate queueing delay asymptotically.
Expected queueing delay remains bounded away from zero under certain resource constraints.
Symmetric policies with minimal resources are insufficient for delay minimization in large systems.
Abstract
We consider the following distributed service model: jobs with unit mean, general distribution, and independent processing times arrive as a renewal process of rate , with , and are immediately dispatched to one of several queues associated with identical servers with unit processing rate. We assume that the dispatching decisions are made by a central dispatcher endowed with a finite memory, and with the ability to exchange messages with the servers. We study the fundamental resource requirements (memory bits and message exchange rate), in order to drive the expected queueing delay in steady-state of a typical job to zero, as increases. We develop a novel approach to show that, within a certain broad class of "symmetric" policies, every dispatching policy with a message rate of the order of , and with a memory of the order of bits, results…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
