Quantum Algorithms for Bandits with Knapsacks with Improved Regret and Time Complexities
Yuexin Su, Ziyi Yang, Peiyuan Huang, Tongyang Li, Yinyu Ye

TL;DR
This paper introduces quantum algorithms for Bandits with Knapsacks, achieving improved regret bounds and faster computation, thus advancing the integration of quantum computing with resource-constrained online learning models.
Contribution
It develops the first quantum algorithms for BwK with provable regret bounds and demonstrates quadratic and polynomial speedups over classical methods.
Findings
Quantum algorithms improve classical regret bounds by a factor related to budget and LP optimal value.
Achieves quadratic improvement in problem-dependent regret parameters.
Provides polynomial speedup in time complexity for solving BwK problems.
Abstract
Bandits with knapsacks (BwK) constitute a fundamental model that combines aspects of stochastic integer programming with online learning. Classical algorithms for BwK with a time horizon achieve a problem-independent regret bound of and a problem-dependent bound of . In this paper, we initiate the study of the BwK model in the setting of quantum computing, where both reward and resource consumption can be accessed via quantum oracles. We establish both problem-independent and problem-dependent regret bounds for quantum BwK algorithms. For the problem-independent case, we demonstrate that a quantum approach can improve the classical regret bound by a factor of , where is budget constraint in BwK and denotes the optimal value of a linear programming relaxation of the BwK problem. For…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
