TL;DR
This paper introduces a neural network-based exploration policy for interactive collaborative filtering, enabling recommender systems to adapt quickly to user preferences and improve long-term satisfaction, especially for cold-start and evolving users.
Contribution
It proposes a novel neural exploration policy trained with reinforcement learning to enhance interactive collaborative filtering performance.
Findings
Outperforms state-of-the-art methods on benchmark datasets.
Effectively balances exploration and exploitation in recommendations.
Improves user satisfaction in cold-start and dynamic user scenarios.
Abstract
In this paper, we study collaborative filtering in an interactive setting, in which the recommender agents iterate between making recommendations and updating the user profile based on the interactive feedback. The most challenging problem in this scenario is how to suggest items when the user profile has not been well established, i.e., recommend for cold-start users or warm-start users with taste drifting. Existing approaches either rely on overly pessimistic linear exploration strategy or adopt meta-learning based algorithms in a full exploitation way. In this work, to quickly catch up with the user's interests, we propose to represent the exploration policy with a neural network and directly learn it from the feedback data. Specifically, the exploration policy is encoded in the weights of multi-channel stacked self-attention neural networks and trained with efficient Q-learning by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsQ-Learning
