Optimal $\delta$-Correct Best-Arm Selection for Heavy-Tailed Distributions
Shubhada Agrawal, Sandeep Juneja, Peter Glynn

TL;DR
This paper develops a $\, ext{delta}$-correct algorithm for best-arm identification in heavy-tailed distributions, matching lower bounds under mild moment restrictions and introducing batch processing for efficiency.
Contribution
It introduces a $\, ext{delta}$-correct algorithm that achieves optimal sample complexity for heavy-tailed distributions with bounded moments, extending previous results beyond exponential families.
Findings
The algorithm matches the asymptotic lower bound as $\, ext{delta}$ approaches zero.
Batch processing significantly speeds up the algorithm.
A mild moment condition suffices for optimality in heavy-tailed settings.
Abstract
Given a finite set of unknown distributions or arms that can be sampled, we consider the problem of identifying the one with the maximum mean using a -correct algorithm (an adaptive, sequential algorithm that restricts the probability of error to a specified ) that has minimum sample complexity. Lower bounds for -correct algorithms are well known. -correct algorithms that match the lower bound asymptotically as reduces to zero have been previously developed when arm distributions are restricted to a single parameter exponential family. In this paper, we first observe a negative result that some restrictions are essential, as otherwise, under a -correct algorithm, distributions with unbounded support would require an infinite number of samples in expectation. We then propose a -correct algorithm that matches the lower bound as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Face and Expression Recognition · Statistical Methods and Inference
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
