Optimal Best Arm Identification with Fixed Confidence

Aur\'elien Garivier (IMT); Emilie Kaufmann (CRIStAL; SEQUEL)

arXiv:1602.04589·math.ST·June 2, 2016·101 cites

Optimal Best Arm Identification with Fixed Confidence

Aur\'elien Garivier (IMT), Emilie Kaufmann (CRIStAL, SEQUEL)

PDF

Open Access

TL;DR

This paper characterizes the complexity of best-arm identification in one-parameter bandit problems, introduces a tight lower bound on sample complexity, and proposes an asymptotically optimal 'Track-and-Stop' strategy with a novel sampling rule and analysis.

Contribution

It provides a complete characterization of the problem's complexity, a new tight lower bound, and an optimal strategy with proven asymptotic optimality.

Findings

01

Established a tight lower bound on sample complexity.

02

Proposed the 'Track-and-Stop' strategy with a novel sampling rule.

03

Proved asymptotic optimality of the proposed method.

Abstract

We give a complete characterization of the complexity of best-arm identification in one-parameter bandit problems. We prove a new, tight lower bound on the sample complexity. We propose the `Track-and-Stop' strategy, which we prove to be asymptotically optimal. It consists in a new sampling rule (which tracks the optimal proportions of arm draws highlighted by the lower bound) and in a stopping rule named after Chernoff, for which we give a new analysis.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Optimization and Search Problems