Lower Bounds on Regret for Noisy Gaussian Process Bandit Optimization
Jonathan Scarlett, Ilijia Bogunovic, Volkan Cevher

TL;DR
This paper establishes fundamental lower bounds on the regret in noisy Gaussian process bandit optimization, revealing the minimal achievable performance and highlighting gaps with existing algorithms.
Contribution
It provides the first algorithm-independent lower bounds on simple and cumulative regret for Gaussian process bandit optimization with smooth kernels.
Findings
Lower bounds match existing upper bounds up to logarithmic factors.
Bounds are derived for squared-exponential and Matérn kernels.
Results highlight fundamental limits of optimization performance in noisy settings.
Abstract
In this paper, we consider the problem of sequentially optimizing a black-box function based on noisy samples and bandit feedback. We assume that is smooth in the sense of having a bounded norm in some reproducing kernel Hilbert space (RKHS), yielding a commonly-considered non-Bayesian form of Gaussian process bandit optimization. We provide algorithm-independent lower bounds on the simple regret, measuring the suboptimality of a single point reported after rounds, and on the cumulative regret, measuring the sum of regrets over the chosen points. For the isotropic squared-exponential kernel in dimensions, we find that an average simple regret of requires , and the average cumulative regret is at least , thus matching existing upper bounds up…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Gaussian Processes and Bayesian Inference · Machine Learning and Algorithms
