Disagreement-Based Combinatorial Pure Exploration: Sample Complexity Bounds and an Efficient Algorithm
Tongyi Cao, Akshay Krishnamurthy

TL;DR
This paper introduces new algorithms for combinatorial pure exploration in multi-arm bandits, achieving improved sample complexity bounds and demonstrating their optimality and efficiency under certain conditions.
Contribution
The authors develop the first interactive algorithms with polynomial improvements in sample complexity for combinatorial pure exploration, supported by new theoretical bounds and efficient implementation methods.
Findings
Achieved polynomial improvements in sample complexity bounds.
Proved no uniform sampling approach can outperform their algorithms.
Provided efficient implementation for cases supporting linear optimization.
Abstract
We design new algorithms for the combinatorial pure exploration problem in the multi-arm bandit framework. In this problem, we are given distributions and a collection of subsets of these distributions, and we would like to find the subset that has largest mean, while collecting, in a sequential fashion, as few samples from the distributions as possible. In both the fixed budget and fixed confidence settings, our algorithms achieve new sample-complexity bounds that provide polynomial improvements on previous results in some settings. Via an information-theoretic lower bound, we show that no approach based on uniform sampling can improve on ours in any regime, yielding the first interactive algorithms for this problem with this basic property. Computationally, we show how to efficiently implement our fixed confidence algorithm…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Optimization and Search Problems
