Asymptotic Optimality of Speed-Aware JSQ for Heterogeneous Systems
Sanidhay Bhambay, Arpan Mukhopadhyay

TL;DR
This paper proves that a speed-aware join-the-shortest-queue scheme achieves delay optimality in large heterogeneous server systems by developing new techniques for establishing tightness and analyzing the fluid limit.
Contribution
It introduces a novel speed-aware JSQ scheme for heterogeneous systems and demonstrates its asymptotic delay optimality using innovative Lyapunov drift techniques.
Findings
Achieves delay optimality in the fluid limit for heterogeneous systems.
Develops a new technique using Lyapunov drift for tightness in steady-state distributions.
Shows the fluid limit has a unique, globally attractive fixed point.
Abstract
The Join-the-Shortest-Queue (JSQ) load-balancing scheme is known to minimise the average delay of jobs in homogeneous systems consisting of identical servers. However, it performs poorly in heterogeneous systems where servers have different processing rates. Finding a delay optimal scheme remains an open problem for heterogeneous systems. In this paper, we consider a speed-aware version of the JSQ scheme for heterogeneous systems and show that it achieves delay optimality in the fluid limit. One of the key issues in establishing this optimality result for heterogeneous systems is to show that the sequence of steady-state distributions indexed by the system size is tight in an appropriately defined space. The usual technique for showing tightness by coupling with a suitably defined dominant system does not work for heterogeneous systems. To prove tightness, we devise a new technique that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Queuing Theory Analysis · Advanced Wireless Network Optimization · Cloud Computing and Resource Management
