Fast Distributed Bandits for Online Recommendation Systems
Kanak Mahadik, Qingyun Wu, Shuai Li, and Amit Sabne

TL;DR
This paper introduces DistCLUB, a scalable distributed bandit algorithm for online recommendation systems that quickly discovers user clusters and significantly improves recommendation accuracy and speed over existing methods.
Contribution
DistCLUB is a novel distributed bandit algorithm that lazily creates clusters, reducing network load and accelerating cluster discovery compared to prior algorithms like DCCB.
Findings
DistCLUB is on average 8.87x faster than DCCB.
DistCLUB achieves 14.5% higher normalized prediction performance.
DistCLUB scales better with increasing users and items.
Abstract
Contextual bandit algorithms are commonly used in recommender systems, where content popularity can change rapidly. These algorithms continuously learn latent mappings between users and items, based on contexts associated with them both. Recent recommendation algorithms that learn clustering or social structures between users have exhibited higher recommendation accuracy. However, as the number of users and items in the environment increases, the time required to generate recommendations deteriorates significantly. As a result, these cannot be deployed in practice. The state-of-the-art distributed bandit algorithm - DCCB - relies on a peer-to-peer net-work to share information among distributed workers. However, this approach does not scale well with the increasing number of users. Furthermore, it suffers from slow discovery of clusters, resulting in accuracy degradation. To address the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Advanced Bandit Algorithms Research · Data Stream Mining Techniques
