Representative Arm Identification: A fixed confidence approach to identify cluster representatives
Sarvesh Gharat, Aniket Yadav, Nikhil Karamchandani, Jayakrishnan Nair

TL;DR
This paper introduces a fixed confidence method for identifying representative arms in multi-armed bandits, providing theoretical bounds and empirical evidence of improved efficiency over existing algorithms.
Contribution
It develops two confidence interval-based algorithms for the RAI problem, matching lower bounds and outperforming alternatives in experiments.
Findings
Proposed algorithms achieve near-optimal sample complexity.
Algorithms outperform existing methods in synthetic and real datasets.
Theoretical bounds confirm efficiency of the new approaches.
Abstract
We study the representative arm identification (RAI) problem in the multi-armed bandits (MAB) framework, wherein we have a collection of arms, each associated with an unknown reward distribution. An underlying instance is defined by a partitioning of the arms into clusters of predefined sizes, such that for any , all arms in cluster have a larger mean reward than those in cluster . The goal in RAI is to reliably identify a certain prespecified number of arms from each cluster, while using as few arm pulls as possible. The RAI problem covers as special cases several well-studied MAB problems such as identifying the best arm or any out of the top , as well as both full and coarse ranking. We start by providing an instance-dependent lower bound on the sample complexity of any feasible algorithm for this setting. We then propose two algorithms, based on the idea of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMetallurgy and Cultural Artifacts
