UCB for Large-Scale Pure Exploration: Beyond Sub-Gaussianity
Zaile Li, Weiwei Fan, L. Jeff Hong

TL;DR
This paper extends the theoretical understanding of UCB algorithms for large-scale pure exploration problems beyond sub-Gaussian distributions, demonstrating their optimality in heavy-tailed and non-sub-Gaussian settings.
Contribution
It introduces a meta-UCB algorithm for non-sub-Gaussian pure exploration and proves its sample optimality under broad distributional assumptions.
Findings
Meta-UCB achieves sample optimality in non-sub-Gaussian settings.
UCB algorithms are effective for large-scale pure exploration beyond sub-Gaussian assumptions.
Numerical experiments validate theoretical results and compare behaviors.
Abstract
Selecting the best alternative from a finite set represents a broad class of pure exploration problems. Traditional approaches to pure exploration have predominantly relied on Gaussian or sub-Gaussian assumptions on the performance distributions of all alternatives, which limit their applicability to non-sub-Gaussian especially heavy-tailed problems. The need to move beyond sub-Gaussianity may become even more critical in large-scale problems, which tend to be especially sensitive to distributional specifications. In this paper, motivated by the widespread use of upper confidence bound (UCB) algorithms in pure exploration and beyond, we investigate their performance in the large-scale, non-sub-Gaussian settings. We consider the simplest category of UCB algorithms, where the UCB value for each alternative is defined as the sample mean plus an exploration bonus that depends only on its…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Optimization and Search Problems · Gaussian Processes and Bayesian Inference
