Optimal Algorithms for Range Searching over Multi-Armed Bandits
Siddharth Barman, Ramakrishnan Krishnamurthy, Saladi Rahul

TL;DR
This paper introduces efficient algorithms for range searching in multi-armed bandit problems, leveraging geometric hitting sets to achieve near-optimal sample complexities with high-probability guarantees.
Contribution
It presents the first sample-efficient algorithms for range searching with stochastic weights in MABs, including multi-dimensional extensions and tight lower bounds.
Findings
Algorithms achieve PAC guarantees with near-optimal sample complexity.
Sample complexity depends on the size of the optimal hitting set.
Lower bounds show the algorithms are essentially tight.
Abstract
This paper studies a multi-armed bandit (MAB) version of the range-searching problem. In its basic form, range searching considers as input a set of points (on the real line) and a collection of (real) intervals. Here, with each specified point, we have an associated weight, and the problem objective is to find a maximum-weight point within every given interval. The current work addresses range searching with stochastic weights: each point corresponds to an arm (that admits sample access) and the point's weight is the (unknown) mean of the underlying distribution. In this MAB setup, we develop sample-efficient algorithms that find, with high probability, near-optimal arms within the given intervals, i.e., we obtain PAC (probably approximately correct) guarantees. We also provide an algorithm for a generalization wherein the weight of each point is a multi-dimensional vector. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
