Regret Bounds for Expected Improvement Algorithms in Gaussian Process Bandit Optimization

Hung Tran-The; Sunil Gupta; Santu Rana; Svetha Venkatesh

arXiv:2203.07875·cs.LG·April 28, 2026

Regret Bounds for Expected Improvement Algorithms in Gaussian Process Bandit Optimization

Hung Tran-The, Sunil Gupta, Santu Rana, Svetha Venkatesh

PDF

1 Video

TL;DR

This paper analyzes the theoretical convergence of expected improvement algorithms in Gaussian process bandit optimization, proposing variants that achieve regret bounds and faster convergence without requiring prior knowledge of certain parameters.

Contribution

The authors introduce a variant of EI with a standard incumbent and prove its convergence with regret bounds, also proposing an improved version that converges faster without needing prior parameter knowledge.

Findings

01

Proposed EI variant converges with a regret bound of O(γ_T√T).

02

Improved GP-EI algorithm achieves faster convergence than previous methods.

03

Empirical results demonstrate the effectiveness of the proposed algorithms.

Abstract

The expected improvement (EI) algorithm is one of the most popular strategies for optimization under uncertainty due to its simplicity and efficiency. Despite its popularity, the theoretical aspects of this algorithm have not been properly analyzed. In particular, whether in the noisy setting, the EI strategy with a standard incumbent converges is still an open question of the Gaussian process bandit optimization problem. We aim to answer this question by proposing a variant of EI with a standard incumbent defined via the GP predictive mean. We prove that our algorithm converges, and achieves a cumulative regret bound of $O (γ_{T} T)$ , where $γ_{T}$ is the maximum information gain between $T$ observations and the Gaussian process model. Based on this variant of EI, we further propose an algorithm called Improved GP-EI that converges faster than previous counterparts.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Regret Bounds for Expected Improvement Algorithms in Gaussian Process Bandit Optimization· slideslive