Near Optimal Best Arm Identification for Clustered Bandits
Yash, Nikhil Karamchandani, Avishek Ghosh

TL;DR
This paper introduces two algorithms for best arm identification in clustered multi-agent bandits, achieving near-optimal sample complexity and communication efficiency, especially when the number of clusters is small.
Contribution
The paper proposes two novel algorithms, Cl-BAI and BAI-Cl, with theoretical guarantees and optimality results for clustered multi-armed bandit problems.
Findings
Both algorithms are $ ext{delta}$-probably correct with proven guarantees.
The sample complexity of BAI-Cl is minimax optimal when the number of clusters is small.
Experiments show superior performance in synthetic and real datasets.
Abstract
This work investigates the problem of best arm identification for multi-agent multi-armed bandits. We consider agents grouped into clusters, where each cluster solves a stochastic bandit problem. The mapping between agents and bandits is a priori unknown. Each bandit is associated with arms, and the goal is to identify the best arm for each agent under a -probably correct (-PC) framework, while minimizing sample complexity and communication overhead. We propose two novel algorithms: Clustering then Best Arm Identification (Cl-BAI) and Best Arm Identification then Clustering (BAI-Cl). Cl-BAI uses a two-phase approach that first clusters agents based on the bandit problems they are learning, followed by identifying the best arm for each cluster. BAI-Cl reverses the sequence by identifying the best arms first and then clustering agents accordingly. Both…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Distributed Sensor Networks and Detection Algorithms · Machine Learning and Algorithms
