Disagreement-Based Combinatorial Pure Exploration: Sample Complexity   Bounds and an Efficient Algorithm

Tongyi Cao; Akshay Krishnamurthy

arXiv:1711.08018·stat.ML·May 29, 2019·1 cites

Disagreement-Based Combinatorial Pure Exploration: Sample Complexity Bounds and an Efficient Algorithm

Tongyi Cao, Akshay Krishnamurthy

PDF

Open Access

TL;DR

This paper introduces new algorithms for combinatorial pure exploration in multi-arm bandits, achieving improved sample complexity bounds and demonstrating their optimality and efficiency under certain conditions.

Contribution

The authors develop the first interactive algorithms with polynomial improvements in sample complexity for combinatorial pure exploration, supported by new theoretical bounds and efficient implementation methods.

Findings

01

Achieved polynomial improvements in sample complexity bounds.

02

Proved no uniform sampling approach can outperform their algorithms.

03

Provided efficient implementation for cases supporting linear optimization.

Abstract

We design new algorithms for the combinatorial pure exploration problem in the multi-arm bandit framework. In this problem, we are given $K$ distributions and a collection of subsets $V \subset 2^{[K]}$ of these distributions, and we would like to find the subset $v \in V$ that has largest mean, while collecting, in a sequential fashion, as few samples from the distributions as possible. In both the fixed budget and fixed confidence settings, our algorithms achieve new sample-complexity bounds that provide polynomial improvements on previous results in some settings. Via an information-theoretic lower bound, we show that no approach based on uniform sampling can improve on ours in any regime, yielding the first interactive algorithms for this problem with this basic property. Computationally, we show how to efficiently implement our fixed confidence algorithm…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Optimization and Search Problems