Frequentist Regret Analysis of Gaussian Process Thompson Sampling via Fractional Posteriors
Somjit Roy, Prateek Jaiswal, Anirban Bhattacharya, Debdeep Pati, Bani K. Mallick

TL;DR
This paper provides a unified, discretization-free frequentist regret analysis for Gaussian Process Thompson Sampling in continuous action spaces, covering various kernels and revealing connections to fractional posteriors.
Contribution
It introduces a novel fractional posterior perspective for GP-TS, deriving kernel-agnostic regret bounds without domain discretization, applicable to multiple kernel types.
Findings
Achieves $ ilde{O}(T^{1/2})$ regret for squared exponential kernel.
Derives regret bounds for Matérn-$ u$ and rational quadratic kernels.
Provides a unified framework applicable across different Gaussian process kernels.
Abstract
We study Gaussian Process Thompson Sampling (GP-TS) for sequential decision-making over compact, continuous action spaces and provide a frequentist regret analysis based on fractional Gaussian process posteriors, without relying on domain discretization as in prior work. We show that the variance inflation commonly assumed in existing analyses of GP-TS can be interpreted as Thompson Sampling with respect to a fractional posterior with tempering parameter . We derive a kernel-agnostic regret bound expressed in terms of the information gain parameter and the posterior contraction rate , and identify conditions on the Gaussian process prior under which can be controlled. As special cases of our general bound, we recover regret of order for the squared exponential kernel,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Advanced Bandit Algorithms Research · Statistical Mechanics and Entropy
