On Learning the $c\mu$ Rule in Single and Parallel Server Networks
Subhashini Krishnasamy, Ari Arapostathis, Ramesh Johari, Sanjay, Shakkottai

TL;DR
This paper studies learning-based scheduling in multi-class queueing systems, showing that simple greedy algorithms can achieve constant regret in single server settings and proposing conditions for stability and regret bounds in parallel server systems.
Contribution
It introduces a greedy learning algorithm with constant regret for single server queues and provides stability conditions and an exploration strategy for parallel server queues.
Findings
Greedy algorithm achieves constant regret in single server queues.
Parallel server queues can be stabilized under certain conditions.
Proposed exploration strategy ensures constant regret in parallel queues.
Abstract
We consider learning-based variants of the rule for scheduling in single and parallel server settings of multi-class queueing systems. In the single server setting, the rule is known to minimize the expected holding-cost (weighted queue-lengths summed over classes and a fixed time horizon). We focus on the problem where the service rates are unknown with the holding-cost regret (regret against the rule with known ) as our objective. We show that the greedy algorithm that uses empirically learned service rates results in a constant holding-cost regret (the regret is independent of the time horizon). This free exploration can be explained in the single server setting by the fact that any work-conserving policy obtains the same number of samples in a busy cycle. In the parallel server setting, we show that the rule may result in unstable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Age of Information Optimization · Stochastic Gradient Optimization Techniques
