Best-Arm Identification in Unimodal Bandits
Riccardo Poiani, Marc Jourdan, Emilie Kaufmann, R\'emy Degenne

TL;DR
This paper investigates the best-arm identification problem in unimodal bandits, deriving lower bounds and proposing algorithms that leverage the unimodal structure for improved efficiency and optimality.
Contribution
It introduces modified algorithms based on Track-and-Stop and Top Two that exploit unimodality, achieving asymptotic optimality and near-optimality with practical efficiency.
Findings
Algorithms are asymptotically optimal for exponential families.
Top Two algorithm is near-optimal for Gaussian distributions.
Empirical results show competitive performance.
Abstract
We study the fixed-confidence best-arm identification problem in unimodal bandits, in which the means of the arms increase with the index of the arm up to their maximum, then decrease. We derive two lower bounds on the stopping time of any algorithm. The instance-dependent lower bound suggests that due to the unimodal structure, only three arms contribute to the leading confidence-dependent cost. However, a worst-case lower bound shows that a linear dependence on the number of arms is unavoidable in the confidence-independent cost. We propose modifications of Track-and-Stop and a Top Two algorithm that leverage the unimodal structure. Both versions of Track-and-Stop are asymptotically optimal for one-parameter exponential families. The Top Two algorithm is asymptotically near-optimal for Gaussian distributions and we prove a non-asymptotic guarantee matching the worse-case lower bound.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research
