Improved Regret Bounds for Gaussian Process Upper Confidence Bound in Bayesian Optimization

Shogo Iwazaki

arXiv:2506.01393·cs.LG·December 12, 2025

Improved Regret Bounds for Gaussian Process Upper Confidence Bound in Bayesian Optimization

Shogo Iwazaki

PDF

Open Access 1 Video

TL;DR

This paper improves the theoretical regret bounds for the GP-UCB algorithm in Bayesian optimization, showing near-optimal performance guarantees for different kernels.

Contribution

It provides tighter regret bounds for GP-UCB under Matérn and squared exponential kernels, bridging the gap with previous bounds.

Findings

01

Achieves O(\u221a{T}) regret for Matrn kernels.

02

Achieves O(r{T} \u2212 r{T} r{2}) regret for squared exponential kernels.

03

Refines analysis of GP-UCB's concentration behavior and information gain.

Abstract

This paper addresses the Bayesian optimization problem (also referred to as the Bayesian setting of the Gaussian process bandit), where the learner seeks to minimize the regret under a function drawn from a known Gaussian process (GP). Under a Mat\'ern kernel with a certain degree of smoothness, we show that the Gaussian process upper confidence bound (GP-UCB) algorithm achieves $\tilde{O} (T)$ cumulative regret with high probability. Furthermore, our analysis yields $O (T ln^{2} T)$ regret under a squared exponential kernel. These results fill the gap between the existing regret upper bound for GP-UCB and the best-known bound provided by Scarlett (2018). The key idea in our proof is to capture the concentration behavior of the input sequence realized by GP-UCB, enabling a more refined analysis of the GP's information gain.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Improved Regret Bounds for Gaussian Process Upper Confidence Bound in Bayesian Optimization· slideslive

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Machine Learning and Data Classification · Advanced Bandit Algorithms Research