On Regret Bounds of Thompson Sampling for Bayesian Optimization
Shion Takeno, Shogo Iwazaki

TL;DR
This paper analyzes regret bounds for Gaussian process Thompson sampling in Bayesian optimization, providing new theoretical insights and bounds that improve understanding of its performance compared to existing methods.
Contribution
It establishes several new regret bounds for GP-TS, including lower bounds, second moment bounds, and improved cumulative regret bounds, filling gaps in prior analyses.
Findings
GP-TS has a polynomial dependence on 1/δ in regret lower bounds.
An upper bound on the second moment of cumulative regret is derived.
Improved bounds on expected lenient regret and cumulative regret are presented.
Abstract
We study a widely used Bayesian optimization method, Gaussian process Thompson sampling (GP-TS), under the assumption that the objective function is a sample path from a GP. Compared with the GP upper confidence bound (GP-UCB) with established high-probability and expected regret bounds, most analyses of GP-TS have been limited to expected regret. Moreover, whether the recent analyses of GP-UCB for the lenient regret and the improved cumulative regret upper bound can be applied to GP-TS remains unclear. To fill these gaps, this paper shows several regret bounds: (i) a regret lower bound for GP-TS, which implies that GP-TS suffers from a polynomial dependence on with probability , (ii) an upper bound of the second moment of cumulative regret, which directly suggests an improved regret upper bound on , (iii) expected lenient regret upper bounds, and (iv) an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Gaussian Processes and Bayesian Inference · Risk and Portfolio Optimization
