Nonparametric Bayesian Optimization for General Rewards

Zishi Zhang; Tao Ren; Yijie Peng

arXiv:2602.07411·cs.LG·February 10, 2026

Nonparametric Bayesian Optimization for General Rewards

Zishi Zhang, Tao Ren, Yijie Peng

PDF

Open Access

TL;DR

This paper introduces a novel nonparametric Bayesian optimization algorithm using an infinite Gaussian process, providing no-regret guarantees for general reward functions and demonstrating superior empirical performance in complex reward scenarios.

Contribution

It presents the first BO algorithm with no-regret guarantees for broad reward models using a new infinite Gaussian process surrogate, extending Bayesian optimization capabilities.

Findings

01

Achieves no-regret guarantee in general reward settings

02

Handles non-stationary and heavy-tailed rewards effectively

03

Demonstrates state-of-the-art empirical performance

Abstract

This work focuses on Bayesian optimization (BO) under reward model uncertainty. We propose the first BO algorithm that achieves no-regret guarantee in a general reward setting, requiring only Lipschitz continuity of the objective function and accommodating a broad class of measurement noise. The core of our approach is a novel surrogate model, termed as infinite Gaussian process ( $\infty$ -GP). It is a Bayesian nonparametric model that places a prior on the space of reward distributions, enabling it to represent a substantially broader class of reward models than classical Gaussian process (GP). The $\infty$ -GP is used in combination with Thompson Sampling (TS) to enable effective exploration and exploitation. Correspondingly, we develop a new TS regret analysis framework for general rewards, which relates the regret to the total variation distance between the surrogate model and the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Gaussian Processes and Bayesian Inference · Advanced Multi-Objective Optimization Algorithms