Finding All {\epsilon}-Good Arms in Stochastic Bandits

Blake Mason; Lalit Jain; Ardhendu Tripathy; and Robert Nowak

arXiv:2006.08850·stat.ML·September 14, 2020·6 cites

Finding All {\epsilon}-Good Arms in Stochastic Bandits

Blake Mason, Lalit Jain, Ardhendu Tripathy, and Robert Nowak

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces algorithms for identifying all psilon-good arms in stochastic bandits, addressing a previously overlooked problem with significant practical applications and demonstrating strong empirical results.

Contribution

The paper presents the first algorithms specifically designed for all-psilon-good arm identification in stochastic bandits, a new challenge in pure-exploration.

Findings

01

Algorithms outperform baselines on large-scale datasets

02

Effective in applications like drug screening and content rating

03

Addresses a novel problem in bandit theory

Abstract

The pure-exploration problem in stochastic multi-armed bandits aims to find one or more arms with the largest (or near largest) means. Examples include finding an {\epsilon}-good arm, best-arm identification, top-k arm identification, and finding all arms with means above a specified threshold. However, the problem of finding all {\epsilon}-good arms has been overlooked in past work, although arguably this may be the most natural objective in many applications. For example, a virologist may conduct preliminary laboratory experiments on a large candidate set of treatments and move all {\epsilon}-good treatments into more expensive clinical trials. Since the ultimate clinical efficacy is uncertain, it is important to identify all {\epsilon}-good candidates. Mathematically, the all-{\epsilon}-good arm identification problem presents significant new challenges and surprises that do not…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

blakemas/all_epsilon
none

Videos

Finding All $\epsilon$-Good Arms in Stochastic Bandits· slideslive

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Optimization and Search Problems · Reinforcement Learning in Robotics