Online Clustering of Bandits

Claudio Gentile; Shuai Li; Giovanni Zappella

arXiv:1401.8257·cs.LG·June 9, 2014·97 cites

Online Clustering of Bandits

Claudio Gentile, Shuai Li, Giovanni Zappella

PDF

Open Access

TL;DR

This paper presents an adaptive clustering algorithm for bandit-based content recommendation, demonstrating improved prediction accuracy and scalability through theoretical analysis and experiments on artificial and real datasets.

Contribution

It introduces a new clustering approach for bandit algorithms, with rigorous regret analysis and empirical validation showing superior performance.

Findings

01

Significant increase in prediction accuracy over existing methods

02

Proven scalability of the proposed algorithm

03

Effective on both artificial and real-world datasets

Abstract

We introduce a novel algorithmic approach to content recommendation based on adaptive clustering of exploration-exploitation ("bandit") strategies. We provide a sharp regret analysis of this algorithm in a standard stochastic noise setting, demonstrate its scalability properties, and prove its effectiveness on a number of artificial and real-world datasets. Our experiments show a significant increase in prediction performance over state-of-the-art methods for bandit problems.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Data Stream Mining Techniques · Recommender Systems and Techniques