PAC Best Arm Identification Under a Deadline
Brijen Thananjeyan, Kirthevasan Kandasamy, Ion Stoica, Michael I., Jordan, Ken Goldberg, Joseph E. Gonzalez

TL;DR
This paper introduces a new algorithm, Elastic Batch Racing, for identifying an approximately optimal arm within a fixed number of decision rounds, addressing challenges in constrained, deadline-driven environments like clinical trials and cloud simulations.
Contribution
The paper formalizes the finite-deadline best arm identification problem, establishes hardness results, and proposes an optimal algorithm tailored for decision-constrained scenarios.
Findings
EBR outperforms baseline methods by several orders of magnitude.
Theoretical bounds show EBR's optimality under the problem's hardness constraints.
Finite deadlines significantly impact the adaptivity and sample complexity in best arm identification.
Abstract
We study -PAC best arm identification, where a decision-maker must identify an -optimal arm with probability at least , while minimizing the number of arm pulls (samples). Most of the work on this topic is in the sequential setting, where there is no constraint on the time taken to identify such an arm; this allows the decision-maker to pull one arm at a time. In this work, the decision-maker is given a deadline of rounds, where, on each round, it can adaptively choose which arms to pull and how many times to pull them; this distinguishes the number of decisions made (i.e., time or number of rounds) from the number of samples acquired (cost). Such situations occur in clinical trials, where one may need to identify a promising treatment under a deadline while minimizing the number of test subjects, or in simulation-based studies run on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Statistical Methods in Clinical Trials · Machine Learning and Algorithms
