An Empirical Process Approach to the Union Bound: Practical Algorithms   for Combinatorial and Linear Bandits

Julian Katz-Samuels; Lalit Jain; Zohar Karnin; Kevin Jamieson

arXiv:2006.11685·cs.LG·June 23, 2020·20 cites

An Empirical Process Approach to the Union Bound: Practical Algorithms for Combinatorial and Linear Bandits

Julian Katz-Samuels, Lalit Jain, Zohar Karnin, Kevin Jamieson

PDF

Open Access 1 Video

TL;DR

This paper introduces near-optimal algorithms for linear bandit pure-exploration problems, utilizing empirical process theory to improve sample complexity bounds and extend to fixed budget settings, especially for combinatorial classes.

Contribution

It develops a novel experimental design based on Gaussian-width, providing tight bounds and efficient algorithms for combinatorial linear bandits in both fixed confidence and fixed budget scenarios.

Findings

01

Sample complexity scales with instance geometry

02

Algorithm matches lower bounds in fixed confidence setting

03

First fixed budget linear bandit algorithm with near-optimal guarantees

Abstract

This paper proposes near-optimal algorithms for the pure-exploration linear bandit problem in the fixed confidence and fixed budget settings. Leveraging ideas from the theory of suprema of empirical processes, we provide an algorithm whose sample complexity scales with the geometry of the instance and avoids an explicit union bound over the number of arms. Unlike previous approaches which sample based on minimizing a worst-case variance (e.g. G-optimal design), we define an experimental design objective based on the Gaussian-width of the underlying arm set. We provide a novel lower bound in terms of this objective that highlights its fundamental role in the sample complexity. The sample complexity of our fixed confidence algorithm matches this lower bound, and in addition is computationally efficient for combinatorial classes, e.g. shortest-path, matchings and matroids, where the arm…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

An Empirical Process Approach to the Union Bound: Practical Algorithms for Combinatorial and Linear Bandits· slideslive

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Optimization and Search Problems