Best-Arm Identification in Unimodal Bandits

Riccardo Poiani; Marc Jourdan; Emilie Kaufmann; R\'emy Degenne

arXiv:2411.01898·cs.LG·May 27, 2025

Best-Arm Identification in Unimodal Bandits

Riccardo Poiani, Marc Jourdan, Emilie Kaufmann, R\'emy Degenne

PDF

Open Access

TL;DR

This paper investigates the best-arm identification problem in unimodal bandits, deriving lower bounds and proposing algorithms that leverage the unimodal structure for improved efficiency and optimality.

Contribution

It introduces modified algorithms based on Track-and-Stop and Top Two that exploit unimodality, achieving asymptotic optimality and near-optimality with practical efficiency.

Findings

01

Algorithms are asymptotically optimal for exponential families.

02

Top Two algorithm is near-optimal for Gaussian distributions.

03

Empirical results show competitive performance.

Abstract

We study the fixed-confidence best-arm identification problem in unimodal bandits, in which the means of the arms increase with the index of the arm up to their maximum, then decrease. We derive two lower bounds on the stopping time of any algorithm. The instance-dependent lower bound suggests that due to the unimodal structure, only three arms contribute to the leading confidence-dependent cost. However, a worst-case lower bound shows that a linear dependence on the number of arms is unavoidable in the confidence-independent cost. We propose modifications of Track-and-Stop and a Top Two algorithm that leverage the unimodal structure. Both versions of Track-and-Stop are asymptotically optimal for one-parameter exponential families. The Top Two algorithm is asymptotically near-optimal for Gaussian distributions and we prove a non-asymptotic guarantee matching the worse-case lower bound.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research