Optimal Algorithms for Range Searching over Multi-Armed Bandits

Siddharth Barman; Ramakrishnan Krishnamurthy; Saladi Rahul

arXiv:2105.01390·cs.LG·May 5, 2021

Optimal Algorithms for Range Searching over Multi-Armed Bandits

Siddharth Barman, Ramakrishnan Krishnamurthy, Saladi Rahul

PDF

TL;DR

This paper introduces efficient algorithms for range searching in multi-armed bandit problems, leveraging geometric hitting sets to achieve near-optimal sample complexities with high-probability guarantees.

Contribution

It presents the first sample-efficient algorithms for range searching with stochastic weights in MABs, including multi-dimensional extensions and tight lower bounds.

Findings

01

Algorithms achieve PAC guarantees with near-optimal sample complexity.

02

Sample complexity depends on the size of the optimal hitting set.

03

Lower bounds show the algorithms are essentially tight.

Abstract

This paper studies a multi-armed bandit (MAB) version of the range-searching problem. In its basic form, range searching considers as input a set of points (on the real line) and a collection of (real) intervals. Here, with each specified point, we have an associated weight, and the problem objective is to find a maximum-weight point within every given interval. The current work addresses range searching with stochastic weights: each point corresponds to an arm (that admits sample access) and the point's weight is the (unknown) mean of the underlying distribution. In this MAB setup, we develop sample-efficient algorithms that find, with high probability, near-optimal arms within the given intervals, i.e., we obtain PAC (probably approximately correct) guarantees. We also provide an algorithm for a generalization wherein the weight of each point is a multi-dimensional vector. The…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.