Optimal Batched Best Arm Identification
Tianyuan Jin, Yu Yang, Jing Tang, Xiaokui Xiao, Pan Xu

TL;DR
This paper introduces new batched algorithms for best arm identification that optimize sample and batch complexity, achieving asymptotic optimality with minimal batches and providing near-optimal results in finite settings.
Contribution
The paper presents the Tri-BBAI and Opt-BBAI algorithms, the first to achieve asymptotic optimality in batch setting and near-optimality in finite cases, with improved complexity guarantees.
Findings
Tri-BBAI achieves asymptotic optimal sample complexity in 3 batches.
Opt-BBAI attains near-optimal complexity in non-asymptotic settings.
The new procedures do not rely on event-conditioned complexity bounds.
Abstract
We study the batched best arm identification (BBAI) problem, where the learner's goal is to identify the best arm while switching the policy as less as possible. In particular, we aim to find the best arm with probability for some small constant while minimizing both the sample complexity (total number of arm pulls) and the batch complexity (total number of batches). We propose the three-batch best arm identification (Tri-BBAI) algorithm, which is the first batched algorithm that achieves the optimal sample complexity in the asymptotic setting (i.e., ) and runs in batches in expectation. Based on Tri-BBAI, we further propose the almost optimal batched best arm identification (Opt-BBAI) algorithm, which is the first algorithm that achieves the near-optimal sample and batch complexity in the non-asymptotic setting (i.e., is finite),…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMachine Learning and Algorithms · Imbalanced Data Classification Techniques
