On Elimination Strategies for Bandit Fixed-Confidence Identification
Andrea Tirinzoni, R\'emy Degenne

TL;DR
This paper introduces an adaptive elimination strategy for bandit identification that combines the computational efficiency of elimination algorithms with the adaptiveness of fully adaptive methods, improving performance in complex settings.
Contribution
The authors propose a novel adaptive elimination approach that enhances existing strategies by integrating elimination into both stopping and sampling rules, applicable to complex combinatorial problems.
Findings
Elimination improves computational complexity in adaptive bandit algorithms.
The new method maintains or improves sample complexity compared to non-elimination strategies.
Experimental results show significant efficiency gains in linear bandit best-arm identification.
Abstract
Elimination algorithms for bandit identification, which prune the plausible correct answers sequentially until only one remains, are computationally convenient since they reduce the problem size over time. However, existing elimination strategies are often not fully adaptive (they update their sampling rule infrequently) and are not easy to extend to combinatorial settings, where the set of answers is exponentially large in the problem dimension. On the other hand, most existing fully-adaptive strategies to tackle general identification problems are computationally demanding since they repeatedly test the correctness of every answer, without ever reducing the problem size. We show that adaptive methods can be modified to use elimination in both their stopping and sampling rules, hence obtaining the best of these two worlds: the algorithms (1) remain fully adaptive, (2) suffer a sample…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Mobile Crowdsensing and Crowdsourcing
