Optimal Best-Arm Identification under Fixed Confidence with Multiple Optima

Lan V. Truong

arXiv:2505.15643·cs.LG·March 5, 2026

Optimal Best-Arm Identification under Fixed Confidence with Multiple Optima

Lan V. Truong

PDF

Open Access

TL;DR

This paper establishes a new lower bound and an asymptotically optimal algorithm for best-arm identification in multi-armed bandits with multiple known optimal arms, improving efficiency in fixed-confidence settings.

Contribution

It introduces a tighter information-theoretic lower bound and modifies the Track-and-Stop algorithm to achieve asymptotic optimality when the number of optimal arms is known.

Findings

01

Derived a new lower bound on sample complexity for multi-optimal arms

02

Proposed a tie-aware stopping rule for Track-and-Stop

03

Proved asymptotic optimality of the modified algorithm

Abstract

We study best-arm identification in stochastic multi-armed bandits under the fixed-confidence setting, focusing on instances with multiple optimal arms. Unlike prior work that addresses the unknown-number-of-optimal-arms case, we consider the setting where the number of optimal arms is known in advance. We derive a new information-theoretic lower bound on the expected sample complexity that leverages this structural knowledge and is strictly tighter than previous bounds. Building on the Track-and-Stop algorithm, we propose a modified, tie-aware stopping rule and prove that it achieves asymptotic instance-optimality, matching the new lower bound. Our results provide the first formal guarantee of optimality for Track-and-Stop in multi-optimal settings with known cardinality, offering both theoretical insights and practical guidance for efficiently identifying any optimal arm.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTarget Tracking and Data Fusion in Sensor Networks · Distributed Sensor Networks and Detection Algorithms · Guidance and Control Systems

MethodsFocus · Sparse Evolutionary Training