Instance-Dependent Regret Analysis of Kernelized Bandits

Shubhanshu Shekhar; Tara Javidi

arXiv:2203.06297·cs.LG·March 15, 2022

Instance-Dependent Regret Analysis of Kernelized Bandits

Shubhanshu Shekhar, Tara Javidi

PDF

Open Access

TL;DR

This paper analyzes the kernelized bandit problem, providing instance-dependent regret lower bounds and proposing an adaptive algorithm that performs well on specific problem instances, improving over worst-case guarantees.

Contribution

It introduces the first instance-dependent regret bounds for kernelized bandits and develops an adaptive algorithm that adjusts to easier problem instances.

Findings

01

Derives instance-dependent regret lower bounds applicable to common algorithms.

02

Proposes a near-optimal, adaptive algorithm that improves performance on simpler instances.

03

Addresses limitations of worst-case analysis by focusing on specific problem complexities.

Abstract

We study the kernelized bandit problem, that involves designing an adaptive strategy for querying a noisy zeroth-order-oracle to efficiently learn about the optimizer of an unknown function $f$ with a norm bounded by $M < \infty$ in a Reproducing Kernel Hilbert Space~(RKHS) associated with a positive definite kernel $K$ . Prior results, working in a \emph{minimax framework}, have characterized the worst-case~(over all functions in the problem class) limits on regret achievable by \emph{any} algorithm, and have constructed algorithms with matching~(modulo polylogarithmic factors) worst-case performance for the \matern family of kernels. These results suffer from two drawbacks. First, the minimax lower bound gives no information about the limits of regret achievable by the commonly used algorithms on specific problem instances. Second, due to their worst-case nature, the existing upper bound…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Stochastic Gradient Optimization Techniques