Open Problem: Best Arm Identification: Almost Instance-Wise Optimality and the Gap Entropy Conjecture
Lijie Chen, Jian Li

TL;DR
This paper investigates the optimal sample complexity for the best arm identification problem in stochastic bandits, proposing a conjecture that introduces the gap entropy as a fundamental instance-wise lower bound, aiming to resolve a longstanding open problem.
Contribution
The authors introduce the gap entropy as a new measure and conjecture it as the instance-wise lower bound for BEST-1-ARM, advancing understanding of optimal sample complexity.
Findings
Proposes the gap entropy as a new complexity measure.
Conjectures the gap entropy as the fundamental instance-wise lower bound.
Highlights the gap between existing upper and lower bounds for the problem.
Abstract
The best arm identification problem (BEST-1-ARM) is the most basic pure exploration problem in stochastic multi-armed bandits. The problem has a long history and attracted significant attention for the last decade. However, we do not yet have a complete understanding of the optimal sample complexity of the problem: The state-of-the-art algorithms achieve a sample complexity of ( is the difference between the largest mean and the mean), while the best known lower bound is for general instances and for the two-arm instances. We propose to study the instance-wise optimality for the BEST-1-ARM problem. Previous work has proved that it is impossible to have an instance optimal algorithm for the 2-arm…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Optimization and Search Problems
