An online learning approach to dynamic pricing and capacity sizing in service systems
Xinyun Chen, Yunan Liu, Guiyu Hong

TL;DR
This paper introduces GOLiQ, an online learning algorithm for dynamic pricing and capacity sizing in queueing systems, achieving near-optimal profit without requiring large-scale assumptions.
Contribution
It proposes a novel online learning framework for queue management that does not depend on heavy-traffic limits, with proven convergence and regret bounds.
Findings
Algorithm converges with logarithmic regret
Effective in various GI/GI/1 queue scenarios
Outperforms traditional static policies
Abstract
We study a dynamic pricing and capacity sizing problem in a queue, where the service provider's objective is to obtain the optimal service fee and service capacity so as to maximize the cumulative expected profit (the service revenue minus the staffing cost and delay penalty). Due to the complex nature of the queueing dynamics, such a problem has no analytic solution so that previous research often resorts to heavy-traffic analysis where both the arrival rate and service rate are sent to infinity. In this work we propose an online learning framework designed for solving this problem which does not require the system's scale to increase. Our framework is dubbed Gradient-based Online Learning in Queue (GOLiQ). GOLiQ organizes the time horizon into successive operational cycles and prescribes an efficient procedure to obtain improved pricing and staffing policies in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Advanced Queuing Theory Analysis · Optimization and Search Problems
