Pure Exploration in Infinitely-Armed Bandit Models with Fixed-Confidence

Maryam Aziz; Jesse Anderton; Emilie Kaufmann (SEQUEL; CNRS; CRIStAL),; Javed Aslam

arXiv:1803.04665·stat.ML·May 22, 2018·6 cites

Pure Exploration in Infinitely-Armed Bandit Models with Fixed-Confidence

Maryam Aziz, Jesse Anderton, Emilie Kaufmann (SEQUEL, CNRS, CRIStAL),, Javed Aslam

PDF

Open Access

TL;DR

This paper studies the problem of identifying near-optimal arms in an infinitely-armed bandit setting without prior knowledge of the arm distribution, providing theoretical bounds and an efficient algorithm.

Contribution

It introduces a PAC-like framework, derives a lower bound, proposes an algorithm with near-optimal sample complexity, and discusses the fundamental limits of two-phase approaches.

Findings

01

Sample complexity lower bound established

02

Algorithm achieves near-optimal sample complexity within a log factor

03

Discussion on the inescapability of log^2(1/delta) dependence

Abstract

We consider the problem of near-optimal arm identification in the fixed confidence setting of the infinitely armed bandit problem when nothing is known about the arm reservoir distribution. We (1) introduce a PAC-like framework within which to derive and cast results; (2) derive a sample complexity lower bound for near-optimal arm identification; (3) propose an algorithm that identifies a nearly-optimal arm with high probability and derive an upper bound on its sample complexity which is within a log factor of our lower bound; and (4) discuss whether our log^2(1/delta) dependence is inescapable for "two-phase" (select arms first, identify the best later) algorithms in the infinite setting. This work permits the application of bandit models to a broader class of problems where fewer assumptions hold.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Reinforcement Learning in Robotics