Online Interactive Collaborative Filtering Using Multi-Armed Bandit with Dependent Arms
Qing Wang, Chunqiu Zeng, Wubai Zhou, Tao Li, Larisa Shwartz, Genady, Ya. Grabarnik

TL;DR
This paper introduces a novel online collaborative filtering approach that models item dependencies as clusters with shared latent topics, improving recommendation accuracy in data-sparse environments.
Contribution
It develops a generative model for item dependencies and an efficient particle learning algorithm, integrating these with multi-armed bandit strategies for better online recommendations.
Findings
Effective in movie and news recommendation tasks
Outperforms baseline methods in accuracy and efficiency
Handles new users and items with limited interaction data
Abstract
Online interactive recommender systems strive to promptly suggest to consumers appropriate items (e.g., movies, news articles) according to the current context including both the consumer and item content information. However, such context information is often unavailable in practice for the recommendation, where only the users' interaction data on items can be utilized. Moreover, the lack of interaction records, especially for new users and items, worsens the performance of recommendation further. To address these issues, collaborative filtering (CF), one of the recommendation techniques relying on the interaction data only, as well as the online multi-armed bandit mechanisms, capable of achieving the balance between exploitation and exploration, are adopted in the online interactive recommendation settings, by assuming independent items (i.e., arms). Nonetheless, the assumption rarely…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
