Optimization for Gaussian Processes via Chaining

Emile Contal; C\'edric Malherbe; Nicolas Vayatis

arXiv:1510.05576·stat.ML·October 20, 2015·1 cites

Optimization for Gaussian Processes via Chaining

Emile Contal, C\'edric Malherbe, Nicolas Vayatis

PDF

Open Access

TL;DR

This paper introduces a generalized Gaussian process optimization method using localized chaining and covering numbers, achieving comparable regret bounds to GP-UCB while improving empirical efficiency across diverse input spaces.

Contribution

It extends the GP-UCB algorithm to arbitrary kernels and spaces with a novel chaining approach and a new optimization scheme based on covering numbers.

Findings

01

Theoretical regret bounds match those of GP-UCB.

02

Algorithm demonstrates improved empirical efficiency.

03

Applicable to complex and simple input spaces.

Abstract

In this paper, we consider the problem of stochastic optimization under a bandit feedback model. We generalize the GP-UCB algorithm [Srinivas and al., 2012] to arbitrary kernels and search spaces. To do so, we use a notion of localized chaining to control the supremum of a Gaussian process, and provide a novel optimization scheme based on the computation of covering numbers. The theoretical bounds we obtain on the cumulative regret are more generic and present the same convergence rates as the GP-UCB algorithm. Finally, the algorithm is shown to be empirically more efficient than its natural competitors on simple and complex input spaces.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Gaussian Processes and Bayesian Inference · Machine Learning and Algorithms