The Influence of Shape Constraints on the Thresholding Bandit Problem

James Cheshire; Pierre Menard; Alexandra Carpentier

arXiv:2006.10006·cs.LG·February 24, 2021·1 cites

The Influence of Shape Constraints on the Thresholding Bandit Problem

James Cheshire, Pierre Menard, Alexandra Carpentier

PDF

Open Access

TL;DR

This paper studies how shape constraints like monotonicity, unimodality, and concavity affect the difficulty and optimal strategies for the Thresholding Bandit Problem, providing minimax regret rates for each case.

Contribution

It derives problem-independent minimax regret rates for TBP under various shape constraints and introduces algorithms tailored to each setting.

Findings

01

Minimax regret rates vary significantly with shape constraints.

02

Shape constraints fundamentally alter the TBP's complexity.

03

Provided algorithms achieve the derived minimax rates.

Abstract

We investigate the stochastic Thresholding Bandit problem (TBP) under several shape constraints. On top of (i) the vanilla, unstructured TBP, we consider the case where (ii) the sequence of arm's means $(μ_{k})_{k}$ is monotonically increasing MTBP, (iii) the case where $(μ_{k})_{k}$ is unimodal UTBP and (iv) the case where $(μ_{k})_{k}$ is concave CTBP. In the TBP problem the aim is to output, at the end of the sequential game, the set of arms whose means are above a given threshold. The regret is the highest gap between a misclassified arm and the threshold. In the fixed budget setting, we provide problem independent minimax rates for the expected regret in all settings, as well as associated algorithms. We prove that the minimax rates for the regret are (i) $lo g (K) K / T$ for TBP, (ii) $lo g (K) / T$ for MTBP, (iii) $K / T$ for UTBP and (iv) $lo g lo g K / T$ for CTBP,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Optimization and Search Problems · Auction Theory and Applications