Learning While Scheduling in Multi-Server Systems with Unknown Statistics: MaxWeight with Discounted UCB
Zixian Yang, R. Srikant, Lei Ying

TL;DR
This paper introduces a novel MaxWeight with discounted UCB algorithm for multi-server queueing systems that learns service rates and schedules jobs simultaneously, achieving near-optimal queue length bounds and significantly improved delay performance.
Contribution
The paper presents a new combined learning and scheduling algorithm that handles unknown service statistics without decoupling phases, improving delay bounds and robustness to nonstationary rates.
Findings
Asymptotic average queue length is bounded by one over traffic slackness.
Exponential tail bounds for queue length are established.
Simulation results show several orders of magnitude delay improvement.
Abstract
Multi-server queueing systems are widely used models for job scheduling in machine learning, wireless networks, crowdsourcing, and healthcare systems. This paper considers a multi-server system with multiple servers and multiple types of jobs, where different job types require different amounts of processing time at different servers. The goal is to schedule jobs on servers without knowing the statistics of the processing times. To fully utilize the processing power of the servers, it is known that one has to at least learn the service rates of different job types on different servers. Prior works on this topic decouple the learning and scheduling phases which leads to either excessive exploration or extremely large job delays. We propose a new algorithm, which combines the MaxWeight scheduling policy with discounted upper confidence bound (UCB), to simultaneously learn the statistics…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIoT and Edge/Fog Computing
Methodstravel james
