Online Clustering of Bandits
Claudio Gentile, Shuai Li, Giovanni Zappella

TL;DR
This paper presents an adaptive clustering algorithm for bandit-based content recommendation, demonstrating improved prediction accuracy and scalability through theoretical analysis and experiments on artificial and real datasets.
Contribution
It introduces a new clustering approach for bandit algorithms, with rigorous regret analysis and empirical validation showing superior performance.
Findings
Significant increase in prediction accuracy over existing methods
Proven scalability of the proposed algorithm
Effective on both artificial and real-world datasets
Abstract
We introduce a novel algorithmic approach to content recommendation based on adaptive clustering of exploration-exploitation ("bandit") strategies. We provide a sharp regret analysis of this algorithm in a standard stochastic noise setting, demonstrate its scalability properties, and prove its effectiveness on a number of artificial and real-world datasets. Our experiments show a significant increase in prediction performance over state-of-the-art methods for bandit problems.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Data Stream Mining Techniques · Recommender Systems and Techniques
