Quantum Bayesian Optimization
Zhongxiang Dai, Gregory Kang Ruey Lau, Arun Verma, Yao Shu, Bryan Kian, Hsiang Low, Patrick Jaillet

TL;DR
This paper introduces a quantum Bayesian optimization algorithm that achieves a regret upper bound of O(polylog T), significantly outperforming classical bounds, and demonstrates its effectiveness through simulations and real quantum computer experiments.
Contribution
The paper presents the first quantum Bayesian optimization algorithm with a regret upper bound of O(polylog T), surpassing classical lower bounds and applicable to complex non-linear problems.
Findings
Q-GP-UCB achieves regret of O(polylog T)
Simulation and quantum computer experiments confirm quantum speedup
Smaller regret than previous quantum linear UCB algorithms
Abstract
Kernelized bandits, also known as Bayesian optimization (BO), has been a prevalent method for optimizing complicated black-box reward functions. Various BO algorithms have been theoretically shown to enjoy upper bounds on their cumulative regret which are sub-linear in the number T of iterations, and a regret lower bound of Omega(sqrt(T)) has been derived which represents the unavoidable regrets for any classical BO algorithm. Recent works on quantum bandits have shown that with the aid of quantum computing, it is possible to achieve tighter regret upper bounds better than their corresponding classical lower bounds. However, these works are restricted to either multi-armed or linear bandits, and are hence not able to solve sophisticated real-world problems with non-linear reward functions. To this end, we introduce the quantum-Gaussian process-upper confidence bound (Q-GP-UCB)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsQuantum Computing Algorithms and Architecture · Advanced Bandit Algorithms Research · Quantum Information and Cryptography
