Nearly-Optimal Algorithm for Adversarial Kernelized Bandits
Shogo Iwazaki

TL;DR
This paper introduces a nearly-optimal algorithm for adversarial kernelized bandits, achieving low regret bounds and including a computationally efficient variant with Nyström approximation.
Contribution
It provides the first nearly-optimal adversarial kernelized bandit algorithm with regret guarantees and a scalable Nyström-based implementation.
Findings
Achieves $ ilde{O}( oot{T}{ ext{γ}_T})$ adversarial regret.
Provides lower bounds confirming optimality for SE and Matérn kernels.
Develops a Nyström approximation variant maintaining near-optimal regret.
Abstract
This paper studies kernelized bandits (also known as Gaussian process bandits) in an adversarial environment, where the reward functions in a known reproducing kernel Hilbert space (RKHS) may be adversarially chosen at each round. We show that the exponential-weight algorithm achieves adversarial regret, where and denote the number of total rounds and the maximum information gain, respectively. For squared exponential (SE) and -Mat\'ern kernels, we also show algorithm-independent lower bounds that guarantee the optimality of our algorithm up to polylogarithmic factors. Furthermore, we present a computationally efficient variant of our algorithm using Nystr\"om approximation while maintaining nearly optimal regret guarantees.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
