Optimal Load Balancing in Bipartite Graphs
Wentao Weng, Xingyu Zhou, R. Srikant

TL;DR
This paper analyzes load balancing policies in bipartite graphs representing job-server constraints with heterogeneous servers, demonstrating asymptotic optimality and providing bounds for finite systems under certain connectivity conditions.
Contribution
It introduces and evaluates the asymptotic optimality of JFSQ and JFIQ policies in bipartite graphs with heterogeneous servers, under well-connectedness assumptions.
Findings
JFSQ and JFIQ are asymptotically optimal in mean response time as server count grows.
Upper bounds on response time are established for finite systems.
Random bipartite graphs can satisfy well-connectedness with sparse edges.
Abstract
Applications in cloud platforms motivate the study of efficient load balancing under job-server constraints and server heterogeneity. In this paper, we study load balancing on a bipartite graph where left nodes correspond to job types and right nodes correspond to servers, with each edge indicating that a job type can be served by a server. Thus edges represent locality constraints, i.e., each job can only be served at servers which contained certain data and/or machine learning (ML) models. Servers in this system can have heterogeneous service rates. In this setting, we investigate the performance of two policies named Join-the-Fastest-of-the-Shortest-Queue (JFSQ) and Join-the-Fastest-of-the-Idle-Queue (JFIQ), which are simple variants of Join-the-Shortest-Queue and Join-the-Idle-Queue, where ties are broken in favor of the fastest servers. Under a "well-connected" graph condition, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInterconnection Networks and Systems · Real-Time Systems Scheduling · Advanced Queuing Theory Analysis
