Minimax Optimal Fixed-Budget Best Arm Identification in Linear Bandits
Junwen Yang, Vincent Y. F. Tan

TL;DR
This paper introduces a new, theoretically optimal algorithm for fixed-budget best arm identification in linear bandits, leveraging G-optimal design, with proven minimax optimality and empirical improvements.
Contribution
The paper proposes OD-LinBAI, a parameter-free, minimax optimal algorithm for linear bandit best arm identification, with analysis based on top-arm gaps and effective dimension.
Findings
OD-LinBAI is minimax optimal up to constants.
The performance depends only on the top $d$ arm gaps.
Empirical results outperform existing methods.
Abstract
We study the problem of best arm identification in linear bandits in the fixed-budget setting. By leveraging properties of the G-optimal design and incorporating it into the arm allocation rule, we design a parameter-free algorithm, Optimal Design-based Linear Best Arm Identification (OD-LinBAI). We provide a theoretical analysis of the failure probability of OD-LinBAI. Instead of all the optimality gaps, the performance of OD-LinBAI depends only on the gaps of the top arms, where is the effective dimension of the linear bandit instance. Complementarily, we present a minimax lower bound for this problem. The upper and lower bounds show that OD-LinBAI is minimax optimal up to constant multiplicative factors in the exponent, which is a significant theoretical improvement over existing methods (e.g., BayesGap, Peace, LinearExploration and GSE), and settles the question of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Machine Learning and Data Classification
