The Transient Cost of Learning in Queueing Systems
Daniel Freund, Thodoris Lykouris, Wentao Weng

TL;DR
This paper introduces the Transient Cost of Learning in Queueing (TCLQ), a metric quantifying the early-stage impact of parameter uncertainty on queue lengths, with theoretical analysis for various queueing systems.
Contribution
It proposes the TCLQ metric, characterizes its behavior in different queueing systems, and develops a unified analysis framework combining Lyapunov and bandit methods.
Findings
TCLQ provides a bound on queue length increase during learning.
The analysis applies to single and multi-queue systems.
A unified framework bridges Lyapunov and bandit techniques.
Abstract
Queueing systems are widely applicable stochastic models with use cases in communication networks, healthcare, service systems, etc. Although their optimal control has been extensively studied, most existing approaches assume perfect knowledge of the system parameters. This assumption rarely holds in practice where there is parameter uncertainty, thus motivating a recent line of work on bandit learning for queueing systems. This nascent stream of research focuses on the asymptotic performance of the proposed algorithms but does not provide insight on the transient performance in the early stages of the learning process. In this paper, we propose the Transient Cost of Learning in Queueing (TCLQ), a new metric that quantifies the maximum increase in time-averaged queue length caused by parameter uncertainty. We characterize the TCLQ of a single-queue multi-server system, and then extend…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Queuing Theory Analysis · Age of Information Optimization · Advanced Wireless Network Optimization
Methodstravel james
