On Elimination Strategies for Bandit Fixed-Confidence Identification

Andrea Tirinzoni; R\'emy Degenne

arXiv:2205.10936·cs.LG·October 25, 2022

On Elimination Strategies for Bandit Fixed-Confidence Identification

Andrea Tirinzoni, R\'emy Degenne

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces an adaptive elimination strategy for bandit identification that combines the computational efficiency of elimination algorithms with the adaptiveness of fully adaptive methods, improving performance in complex settings.

Contribution

The authors propose a novel adaptive elimination approach that enhances existing strategies by integrating elimination into both stopping and sampling rules, applicable to complex combinatorial problems.

Findings

01

Elimination improves computational complexity in adaptive bandit algorithms.

02

The new method maintains or improves sample complexity compared to non-elimination strategies.

03

Experimental results show significant efficiency gains in linear bandit best-arm identification.

Abstract

Elimination algorithms for bandit identification, which prune the plausible correct answers sequentially until only one remains, are computationally convenient since they reduce the problem size over time. However, existing elimination strategies are often not fully adaptive (they update their sampling rule infrequently) and are not easy to extend to combinatorial settings, where the set of answers is exponentially large in the problem dimension. On the other hand, most existing fully-adaptive strategies to tackle general identification problems are computationally demanding since they repeatedly test the correctness of every answer, without ever reducing the problem size. We show that adaptive methods can be modified to use elimination in both their stopping and sampling rules, hence obtaining the best of these two worlds: the algorithms (1) remain fully adaptive, (2) suffer a sample…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

andreatirinzoni/bandit-elimination
noneOfficial

Videos

On Elimination Strategies for Bandit Fixed-Confidence Identification· slideslive

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Mobile Crowdsensing and Crowdsourcing