Efficient Clustering in Stochastic Bandits
G Dhinesh Chandran, Kota Srinivas Reddy, Srikrishna Bhashyam

TL;DR
This paper introduces an efficient algorithm for bandit clustering that groups data sequences into clusters with fixed confidence, handling diverse distributions and reducing computational costs while maintaining asymptotic optimality.
Contribution
The paper proposes EBC, an efficient, asymptotically optimal bandit clustering algorithm that simplifies the sampling rule, and a heuristic variant EBC-H, improving computational efficiency.
Findings
EBC achieves asymptotic optimality in synthetic datasets.
EBC and EBC-H outperform existing algorithms in computational efficiency.
Both algorithms show performance gains on real-world datasets.
Abstract
We study the Bandit Clustering (BC) problem under the fixed confidence setting, where the objective is to group a collection of data sequences (arms) into clusters through sequential sampling from adaptively selected arms at each time step while ensuring a fixed error probability at the stopping time. We consider a setting where arms in a cluster may have different distributions. Unlike existing results in this setting, which assume Gaussian-distributed arms, we study a broader class of vector-parametric distributions that satisfy mild regularity conditions. Existing asymptotically optimal BC algorithms require solving an optimization problem as part of their sampling rule at each step, which is computationally costly. We propose an Efficient Bandit Clustering algorithm (EBC), which, instead of solving the full optimization problem, takes a single step toward the optimal value at each…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques · Mobile Crowdsensing and Crowdsourcing
