Nearly-Optimal Algorithm for Adversarial Kernelized Bandits

Shogo Iwazaki

arXiv:2605.10299·cs.LG·May 12, 2026

Nearly-Optimal Algorithm for Adversarial Kernelized Bandits

Shogo Iwazaki

PDF

TL;DR

This paper introduces a nearly-optimal algorithm for adversarial kernelized bandits, achieving low regret bounds and including a computationally efficient variant with Nyström approximation.

Contribution

It provides the first nearly-optimal adversarial kernelized bandit algorithm with regret guarantees and a scalable Nyström-based implementation.

Findings

01

Achieves $ ilde{O}( oot{T}{ ext{γ}_T})$ adversarial regret.

02

Provides lower bounds confirming optimality for SE and Matérn kernels.

03

Develops a Nyström approximation variant maintaining near-optimal regret.

Abstract

This paper studies kernelized bandits (also known as Gaussian process bandits) in an adversarial environment, where the reward functions in a known reproducing kernel Hilbert space (RKHS) may be adversarially chosen at each round. We show that the exponential-weight algorithm achieves $\tilde{O} (T γ_{T})$ adversarial regret, where $T$ and $γ_{T}$ denote the number of total rounds and the maximum information gain, respectively. For squared exponential (SE) and $ν$ -Mat\'ern kernels, we also show algorithm-independent lower bounds that guarantee the optimality of our algorithm up to polylogarithmic factors. Furthermore, we present a computationally efficient variant of our algorithm using Nystr\"om approximation while maintaining nearly optimal regret guarantees.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.