Universality of Power-of-$d$ Load Balancing in Many-Server Systems
Debankur Mukherjee, Sem C. Borst, Johan S. H. van Leeuwaarden, and, Philip A. Whiting

TL;DR
This paper demonstrates that the power-of-$d$ load balancing scheme in many-server systems exhibits a universal behavior similar to the Join-the-Shortest Queue policy, maintaining optimality at fluid and diffusion scales even with reduced overhead.
Contribution
It proves the universality of the JSQ($d(N)$) scheme's fluid and diffusion limits, showing they match the JSQ policy under broad growth conditions of $d(N)$.
Findings
Fluid limit matches JSQ policy when $d(N) o \infty$
Diffusion limit coincides with JSQ under specified growth conditions
Reduces overhead while preserving optimality at key scales
Abstract
We consider a system of parallel single-server queues with unit exponential service rates and a single dispatcher where tasks arrive as a Poisson process of rate . When a task arrives, the dispatcher assigns it to a server with the shortest queue among randomly selected servers (). This load balancing strategy is referred to as a JSQ() scheme, marking that it subsumes the celebrated Join-the-Shortest Queue (JSQ) policy as a crucial special case for . We construct a stochastic coupling to bound the difference in the queue length processes between the JSQ policy and a scheme with an arbitrary value of . We use the coupling to derive the fluid limit in the regime where as with , along with the associated fixed point. The fluid limit turns out not to depend on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
