Max K-armed bandit: On the ExtremeHunter algorithm and beyond

Mastane Achab; Stephan Cl\'emen\c{c}on; Aur\'elien Garivier; Anne; Sabourin; Claire Vernade

arXiv:1707.08820·stat.ML·July 28, 2017

Max K-armed bandit: On the ExtremeHunter algorithm and beyond

Mastane Achab, Stephan Cl\'emen\c{c}on, Aur\'elien Garivier, Anne, Sabourin, Claire Vernade

PDF

Open Access

TL;DR

This paper advances the understanding of the max K-armed bandit problem by refining the analysis of the ExtremeHunter algorithm and proposing a reduction to classical bandit problems, supported by numerical comparisons.

Contribution

It refines the analysis of ExtremeHunter and introduces a reduction of Extreme Bandits to classical bandit problems, offering new theoretical insights.

Findings

01

Refined analysis of ExtremeHunter algorithm

02

Proposed reduction of Extreme Bandits to classical bandit problems

03

Numerical experiments comparing approaches

Abstract

This paper is devoted to the study of the max K-armed bandit problem, which consists in sequentially allocating resources in order to detect extreme values. Our contribution is twofold. We first significantly refine the analysis of the ExtremeHunter algorithm carried out in Carpentier and Valko (2014), and next propose an alternative approach, showing that, remarkably, Extreme Bandits can be reduced to a classical version of the bandit problem to a certain extent. Beyond the formal analysis, these two approaches are compared through numerical experiments.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Risk and Portfolio Optimization · Optimization and Search Problems