Sharper Guarantees for Misspecified Kernelized Bandit Optimization
Davide Maran, Csaba Szepesv\'ari

TL;DR
This paper improves theoretical guarantees for misspecified kernelized bandit optimization by reducing the impact of misspecification to logarithmic or polylogarithmic growth through spectral localization techniques.
Contribution
It introduces new offline and online bounds that significantly lower the misspecification penalty for a broad class of kernels, using spectral localization and domain-splitting methods.
Findings
Offline simple-regret bounds with spectral Lebesgue constant control.
Logarithmic amplification for 1D monotone spectra.
Polylogarithmic amplification for multivariate Fourier-diagonal kernels.
Abstract
Existing guarantees for misspecified kernelized bandit optimization pay for misspecification through kernel complexity: in generic offline bounds, the misspecification level is multiplied by , where is the kernel effective dimension, while in online regret bounds, the corresponding penalty is , where is the maximum information gain after rounds of interaction. In this work, we show that, for a large class of kernels, the misspecification amplification can be reduced to logarithmic or polylogarithmic growth. In the offline setting, we first prove high-probability simple-regret bounds whose misspecification term is governed by a spectral Lebesgue constant. This yields logarithmic amplification for one-dimensional monotone spectra and polylogarithmic amplification for multivariate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
