Open Problem: Best Arm Identification: Almost Instance-Wise Optimality   and the Gap Entropy Conjecture

Lijie Chen; Jian Li

arXiv:1605.08481·cs.LG·May 30, 2016

Open Problem: Best Arm Identification: Almost Instance-Wise Optimality and the Gap Entropy Conjecture

Lijie Chen, Jian Li

PDF

Open Access

TL;DR

This paper investigates the optimal sample complexity for the best arm identification problem in stochastic bandits, proposing a conjecture that introduces the gap entropy as a fundamental instance-wise lower bound, aiming to resolve a longstanding open problem.

Contribution

The authors introduce the gap entropy as a new measure and conjecture it as the instance-wise lower bound for BEST-1-ARM, advancing understanding of optimal sample complexity.

Findings

01

Proposes the gap entropy as a new complexity measure.

02

Conjectures the gap entropy as the fundamental instance-wise lower bound.

03

Highlights the gap between existing upper and lower bounds for the problem.

Abstract

The best arm identification problem (BEST-1-ARM) is the most basic pure exploration problem in stochastic multi-armed bandits. The problem has a long history and attracted significant attention for the last decade. However, we do not yet have a complete understanding of the optimal sample complexity of the problem: The state-of-the-art algorithms achieve a sample complexity of $O (\sum_{i = 2}^{n} Δ_{i}^{- 2} (ln δ^{- 1} + ln ln Δ_{i}^{- 1}))$ ( $Δ_{i}$ is the difference between the largest mean and the $i^{t h}$ mean), while the best known lower bound is $Ω (\sum_{i = 2}^{n} Δ_{i}^{- 2} ln δ^{- 1})$ for general instances and $Ω (Δ^{- 2} ln ln Δ^{- 1})$ for the two-arm instances. We propose to study the instance-wise optimality for the BEST-1-ARM problem. Previous work has proved that it is impossible to have an instance optimal algorithm for the 2-arm…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Optimization and Search Problems