Optimal Multi-Fidelity Best-Arm Identification
Riccardo Poiani, R\'emy Degenne, Emilie Kaufmann, Alberto Maria Metelli, Marcello Restelli

TL;DR
This paper establishes a tight lower bound and proposes an asymptotically optimal algorithm for multi-fidelity best-arm identification, improving efficiency in identifying the best option with varying sampling costs.
Contribution
It introduces a new instance-dependent lower bound and a gradient-based algorithm that achieves asymptotic optimality in multi-fidelity bandit problems.
Findings
The new algorithm outperforms existing methods in experiments.
Theoretical analysis confirms asymptotic optimality.
Insights into optimal fidelity levels for each arm.
Abstract
In bandit best-arm identification, an algorithm is tasked with finding the arm with highest mean reward with a specified accuracy as fast as possible. We study multi-fidelity best-arm identification, in which the algorithm can choose to sample an arm at a lower fidelity (less accurate mean estimate) for a lower cost. Several methods have been proposed for tackling this problem, but their optimality remain elusive, notably due to loose lower bounds on the total cost needed to identify the best arm. Our first contribution is a tight, instance-dependent lower bound on the cost complexity. The study of the optimization problem featured in the lower bound provides new insights to devise computationally efficient algorithms, and leads us to propose a gradient-based approach with asymptotically optimal cost complexity. We demonstrate the benefits of the new algorithm compared to existing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Adversarial Robustness in Machine Learning
