On the Sublinear Regret of GP-UCB

Justin Whitehouse; Zhiwei Steven Wu; Aaditya Ramdas

arXiv:2307.07539·cs.LG·August 15, 2023

On the Sublinear Regret of GP-UCB

Justin Whitehouse, Zhiwei Steven Wu, Aaditya Ramdas

PDF

Open Access

TL;DR

This paper proves that the GP-UCB algorithm achieves nearly optimal sublinear regret in kernelized bandit problems, including for commonly used kernels like Matérn, by introducing a new analysis technique based on regularizing kernel ridge estimators.

Contribution

The paper provides the first tight regret bounds for GP-UCB, improving analysis for Matérn kernels and resolving a longstanding open problem in the field.

Findings

01

GP-UCB achieves nearly optimal sublinear regret.

02

Improved regret bounds for Matérn kernels.

03

New analysis technique using regularized kernel ridge estimators.

Abstract

In the kernelized bandit problem, a learner aims to sequentially compute the optimum of a function lying in a reproducing kernel Hilbert space given only noisy evaluations at sequentially chosen points. In particular, the learner aims to minimize regret, which is a measure of the suboptimality of the choices made. Arguably the most popular algorithm is the Gaussian Process Upper Confidence Bound (GP-UCB) algorithm, which involves acting based on a simple linear estimator of the unknown function. Despite its popularity, existing analyses of GP-UCB give a suboptimal regret rate, which fails to be sublinear for many commonly used kernels such as the Mat\'ern kernel. This has led to a longstanding open question: are existing regret analyses for GP-UCB tight, or can bounds be improved by using more sophisticated analytical techniques? In this work, we resolve this open question and show that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Gaussian Processes and Bayesian Inference · Distributed Sensor Networks and Detection Algorithms

MethodsGaussian Process