Power-of-$d$ Choices Load Balancing in the Sub-Halfin Whitt Regime
Sushil Mahavir Varma, Francisco Castro, Siva Theja Maguluri

TL;DR
This paper analyzes the power-of-d choices load balancing algorithm in a large-scale, heavy-traffic setting, revealing how the queue length behavior depends on the growth rate of d relative to the number of servers.
Contribution
It characterizes the queue length distribution for various growth rates of d in the sub-Halfin-Whitt regime, extending previous results to smaller d values.
Findings
Queue length is approximately m when d is polynomial in n with exponent 1/m.
Queue length becomes unbounded when d grows polylogarithmically in n.
High probability bounds on queue lengths are established using iterative state space collapse.
Abstract
We consider the load balancing system under Poisson arrivals, exponential services, and homogeneous servers. Upon arrival, a job is to be routed to one of the servers, where it is queued until service. We consider the Power-of- choices routing algorithm, which chooses the queue with minimum length among randomly sampled queues. We study this system in the many-server heavy-traffic regime where the number of servers goes to infinity simultaneously when the load approaches the capacity. In particular, we consider a sequence of systems with servers and the arrival rate is given by for some , known as the sub-Halfin-Whitt regime. It was shown by [Liu Ying (2020)] that under Power-of- choices routing with , the queue length behaves similarly to that of JSQ and that there are asymptotically zero queueing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Queuing Theory Analysis · Optimization and Search Problems · Distributed systems and fault tolerance
