Hierarchical Reinforcement Learning for Modeling User Novelty-Seeking Intent in Recommender Systems
Pan Li, Yuyan Wang, Ed H. Chi, Minmin Chen

TL;DR
This paper introduces a hierarchical reinforcement learning approach to model and adapt to users' layered novelty-seeking behaviors in recommender systems, improving long-term user engagement by balancing exploration and preference.
Contribution
The paper presents a novel hierarchical RL method that explicitly models static and dynamic aspects of user novelty-seeking intent for personalized recommendations.
Findings
Hierarchical RL effectively captures user novelty-seeking intent.
The proposed method outperforms state-of-the-art baselines.
Incorporating diversity and novelty in reward improves exploration.
Abstract
Recommending novel content, which expands user horizons by introducing them to new interests, has been shown to improve users' long-term experience on recommendation platforms \cite{chen2021values}. Users however are not constantly looking to explore novel content. It is therefore crucial to understand their novelty-seeking intent and adjust the recommendation policy accordingly. Most existing literature models a user's propensity to choose novel content or to prefer a more diverse set of recommendations at individual interactions. Hierarchical structure, on the other hand, exists in a user's novelty-seeking intent, which is manifested as a static and intrinsic user preference for seeking novelty along with a dynamic session-based propensity. To this end, we propose a novel hierarchical reinforcement learning-based method to model the hierarchical user novelty-seeking intent, and to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Advanced Bandit Algorithms Research · Sentiment Analysis and Opinion Mining
