Online Clustering of Data Sequences with Bandit Information
G Dhinesh Chandran, Srinivas Reddy Kota, Srikrishna Bhashyam

TL;DR
This paper introduces algorithms for efficiently clustering data sequences in a multi-armed bandit setting, achieving near-optimal sample complexity and error guarantees, with practical computational advantages.
Contribution
It proposes the ATBOC algorithm and variants like LUCBBOC and BOC-ELIM for online clustering with bandit feedback, extending theoretical guarantees to multivariate and exponential family distributions.
Findings
ATBOC is asymptotically order-optimal for Gaussian arms.
LUCBBOC and BOC-ELIM are computationally efficient with comparable performance.
Algorithms achieve $ ext{delta}$-probably correct clustering with validated asymptotic guarantees.
Abstract
We study the problem of online clustering of data sequences in the multi-armed bandit (MAB) framework under the fixed-confidence setting. There are arms, each providing i.i.d. samples from a parametric distribution whose parameters are unknown. The arms form clusters based on the distance between the true parameters. In the MAB setting, one arm can be sampled at each time. The objective is to estimate the clusters of the arms using as few samples as possible from the arms, subject to an upper bound on the error probability. Our setting allows for: arms within a cluster to have non-identical distributions, vector parameter arms, vector observations, and clusters. We propose and analyze the Average Tracking Bandit Online Clustering (ATBOC) algorithm. ATBOC is asymptotically order-optimal for multivariate Gaussian arms, with expected sample complexity grows at most…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Data Stream Mining Techniques · Recommender Systems and Techniques
