On Learning the $c\mu$ Rule in Single and Parallel Server Networks

Subhashini Krishnasamy; Ari Arapostathis; Ramesh Johari; Sanjay; Shakkottai

arXiv:1802.06723·cs.PF·July 3, 2018

On Learning the $c\mu$ Rule in Single and Parallel Server Networks

Subhashini Krishnasamy, Ari Arapostathis, Ramesh Johari, Sanjay, Shakkottai

PDF

Open Access

TL;DR

This paper studies learning-based scheduling in multi-class queueing systems, showing that simple greedy algorithms can achieve constant regret in single server settings and proposing conditions for stability and regret bounds in parallel server systems.

Contribution

It introduces a greedy learning algorithm with constant regret for single server queues and provides stability conditions and an exploration strategy for parallel server queues.

Findings

01

Greedy algorithm achieves constant regret in single server queues.

02

Parallel server queues can be stabilized under certain conditions.

03

Proposed exploration strategy ensures constant regret in parallel queues.

Abstract

We consider learning-based variants of the $c μ$ rule for scheduling in single and parallel server settings of multi-class queueing systems. In the single server setting, the $c μ$ rule is known to minimize the expected holding-cost (weighted queue-lengths summed over classes and a fixed time horizon). We focus on the problem where the service rates $μ$ are unknown with the holding-cost regret (regret against the $c μ$ rule with known $μ$ ) as our objective. We show that the greedy algorithm that uses empirically learned service rates results in a constant holding-cost regret (the regret is independent of the time horizon). This free exploration can be explained in the single server setting by the fact that any work-conserving policy obtains the same number of samples in a busy cycle. In the parallel server setting, we show that the $c μ$ rule may result in unstable…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Age of Information Optimization · Stochastic Gradient Optimization Techniques