UCB for Large-Scale Pure Exploration: Beyond Sub-Gaussianity

Zaile Li; Weiwei Fan; L. Jeff Hong

arXiv:2511.22273·stat.ML·December 1, 2025

UCB for Large-Scale Pure Exploration: Beyond Sub-Gaussianity

Zaile Li, Weiwei Fan, L. Jeff Hong

PDF

Open Access

TL;DR

This paper extends the theoretical understanding of UCB algorithms for large-scale pure exploration problems beyond sub-Gaussian distributions, demonstrating their optimality in heavy-tailed and non-sub-Gaussian settings.

Contribution

It introduces a meta-UCB algorithm for non-sub-Gaussian pure exploration and proves its sample optimality under broad distributional assumptions.

Findings

01

Meta-UCB achieves sample optimality in non-sub-Gaussian settings.

02

UCB algorithms are effective for large-scale pure exploration beyond sub-Gaussian assumptions.

03

Numerical experiments validate theoretical results and compare behaviors.

Abstract

Selecting the best alternative from a finite set represents a broad class of pure exploration problems. Traditional approaches to pure exploration have predominantly relied on Gaussian or sub-Gaussian assumptions on the performance distributions of all alternatives, which limit their applicability to non-sub-Gaussian especially heavy-tailed problems. The need to move beyond sub-Gaussianity may become even more critical in large-scale problems, which tend to be especially sensitive to distributional specifications. In this paper, motivated by the widespread use of upper confidence bound (UCB) algorithms in pure exploration and beyond, we investigate their performance in the large-scale, non-sub-Gaussian settings. We consider the simplest category of UCB algorithms, where the UCB value for each alternative is defined as the sample mean plus an exploration bonus that depends only on its…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Optimization and Search Problems · Gaussian Processes and Bayesian Inference