Lower Bounds on Regret for Noisy Gaussian Process Bandit Optimization

Jonathan Scarlett; Ilijia Bogunovic; Volkan Cevher

arXiv:1706.00090·stat.ML·June 1, 2018·30 cites

Lower Bounds on Regret for Noisy Gaussian Process Bandit Optimization

Jonathan Scarlett, Ilijia Bogunovic, Volkan Cevher

PDF

Open Access

TL;DR

This paper establishes fundamental lower bounds on the regret in noisy Gaussian process bandit optimization, revealing the minimal achievable performance and highlighting gaps with existing algorithms.

Contribution

It provides the first algorithm-independent lower bounds on simple and cumulative regret for Gaussian process bandit optimization with smooth kernels.

Findings

01

Lower bounds match existing upper bounds up to logarithmic factors.

02

Bounds are derived for squared-exponential and Matérn kernels.

03

Results highlight fundamental limits of optimization performance in noisy settings.

Abstract

In this paper, we consider the problem of sequentially optimizing a black-box function $f$ based on noisy samples and bandit feedback. We assume that $f$ is smooth in the sense of having a bounded norm in some reproducing kernel Hilbert space (RKHS), yielding a commonly-considered non-Bayesian form of Gaussian process bandit optimization. We provide algorithm-independent lower bounds on the simple regret, measuring the suboptimality of a single point reported after $T$ rounds, and on the cumulative regret, measuring the sum of regrets over the $T$ chosen points. For the isotropic squared-exponential kernel in $d$ dimensions, we find that an average simple regret of $ϵ$ requires $T = Ω (\frac{1}{ϵ ^{2}} (lo g \frac{1}{ϵ})^{d /2})$ , and the average cumulative regret is at least $Ω (T (lo g T)^{d /2})$ , thus matching existing upper bounds up…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Gaussian Processes and Bayesian Inference · Machine Learning and Algorithms