Improved Algorithm on Online Clustering of Bandits

Shuai Li; Wei Chen; Shuai Li; Kwong-Sak Leung

arXiv:1902.09162·cs.LG·July 3, 2019·6 cites

Improved Algorithm on Online Clustering of Bandits

Shuai Li, Wei Chen, Shuai Li, Kwong-Sak Leung

PDF

Open Access

TL;DR

This paper introduces a generalized online clustering algorithm for bandits that handles non-uniform user distributions, offering improved efficiency and regret bounds, validated through experiments on synthetic and real data.

Contribution

It presents a novel algorithm for online clustering of bandits that accommodates non-uniform user frequencies and provides theoretical regret guarantees.

Findings

01

The new algorithm outperforms existing methods in experiments.

02

Regret bounds are independent of minimal user frequency.

03

Algorithm demonstrates consistent advantages on synthetic and real datasets.

Abstract

We generalize the setting of online clustering of bandits by allowing non-uniform distribution over user frequencies. A more efficient algorithm is proposed with simple set structures to represent clusters. We prove a regret bound for the new algorithm which is free of the minimal frequency over users. The experiments on both synthetic and real datasets consistently show the advantage of the new algorithm over existing methods.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Recommender Systems and Techniques · Data Stream Mining Techniques