Pure Exploration for Constrained Best Mixed Arm Identification with a Fixed Budget
Dengwang Tang, Rahul Jain, Ashutosh Nayyar, Pierluigi Nuzzo

TL;DR
This paper introduces the CBMAI problem in stochastic bandits, proposing a novel algorithm with theoretical guarantees for identifying optimal mixed arms under cost constraints within a fixed budget.
Contribution
It presents a new constrained best mixed arm identification problem, a parameter-free algorithm, and theoretical bounds on mis-identification probability.
Findings
Mis-identification probability decays exponentially with budget
The proposed algorithm effectively identifies optimal mixed arms
Theoretical bounds match empirical results
Abstract
In this paper, we introduce the constrained best mixed arm identification (CBMAI) problem with a fixed budget. This is a pure exploration problem in a stochastic finite armed bandit model. Each arm is associated with a reward and multiple types of costs from unknown distributions. Unlike the unconstrained best arm identification problem, the optimal solution for the CBMAI problem may be a randomized mixture of multiple arms. The goal thus is to find the best mixed arm that maximizes the expected reward subject to constraints on the expected costs with a given learning budget . We propose a novel, parameter-free algorithm, called the Score Function-based Successive Reject (SFSR) algorithm, that combines the classical successive reject framework with a novel score-function-based rejection criteria based on linear programming theory to identify the optimal support. We provide a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsManufacturing Process and Optimization · Advanced Measurement and Metrology Techniques
