The Influence of Shape Constraints on the Thresholding Bandit Problem
James Cheshire, Pierre Menard, Alexandra Carpentier

TL;DR
This paper studies how shape constraints like monotonicity, unimodality, and concavity affect the difficulty and optimal strategies for the Thresholding Bandit Problem, providing minimax regret rates for each case.
Contribution
It derives problem-independent minimax regret rates for TBP under various shape constraints and introduces algorithms tailored to each setting.
Findings
Minimax regret rates vary significantly with shape constraints.
Shape constraints fundamentally alter the TBP's complexity.
Provided algorithms achieve the derived minimax rates.
Abstract
We investigate the stochastic Thresholding Bandit problem (TBP) under several shape constraints. On top of (i) the vanilla, unstructured TBP, we consider the case where (ii) the sequence of arm's means is monotonically increasing MTBP, (iii) the case where is unimodal UTBP and (iv) the case where is concave CTBP. In the TBP problem the aim is to output, at the end of the sequential game, the set of arms whose means are above a given threshold. The regret is the highest gap between a misclassified arm and the threshold. In the fixed budget setting, we provide problem independent minimax rates for the expected regret in all settings, as well as associated algorithms. We prove that the minimax rates for the regret are (i) for TBP, (ii) for MTBP, (iii) for UTBP and (iv) for CTBP,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Optimization and Search Problems · Auction Theory and Applications
