TL;DR
This paper introduces three simple Bayesian algorithms for efficiently identifying the best option among many, with proven optimality and exponential convergence in measurement effort.
Contribution
It proposes and analyzes three intuitive Bayesian algorithms for best arm identification, establishing their sharp optimality properties.
Findings
Algorithms achieve exponential convergence to correct identification.
Proven optimality bounds for the proposed algorithms.
Effective adaptive measurement allocation in noisy settings.
Abstract
This paper considers the optimal adaptive allocation of measurement effort for identifying the best among a finite set of options or designs. An experimenter sequentially chooses designs to measure and observes noisy signals of their quality with the goal of confidently identifying the best design after a small number of measurements. This paper proposes three simple and intuitive Bayesian algorithms for adaptively allocating measurement effort, and formalizes a sense in which these seemingly naive rules are the best possible. One proposal is top-two probability sampling, which computes the two designs with the highest posterior probability of being optimal, and then randomizes to select among these two. One is a variant of top-two sampling which considers not only the probability a design is optimal, but the expected amount by which its quality exceeds that of other designs. The final…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
