AURO: Reinforcement Learning for Adaptive User Retention Optimization in Recommender Systems
Zhenghai Xue, Qingpeng Cai, Bin Yang, Lantao Hu, Peng Jiang, Kun Gai,, Bo An

TL;DR
This paper presents AURO, a reinforcement learning approach that adapts to non-stationary user behaviors in recommender systems, improving retention by dynamically adjusting policies through state abstraction and performance-aware exploration.
Contribution
AURO introduces a novel state abstraction module and a performance-based loss function to handle environment non-stationarity in reinforcement learning for recommendations.
Findings
AURO outperforms baseline algorithms in user retention simulations.
AURO demonstrates superior performance on MovieLens and live video platforms.
The approach effectively adapts to evolving user behavior patterns.
Abstract
The field of Reinforcement Learning (RL) has garnered increasing attention for its ability of optimizing user retention in recommender systems. A primary obstacle in this optimization process is the environment non-stationarity stemming from the continual and complex evolution of user behavior patterns over time, such as variations in interaction rates and retention propensities. These changes pose significant challenges to existing RL algorithms for recommendations, leading to issues with dynamics and reward distribution shifts. This paper introduces a novel approach called \textbf{A}daptive \textbf{U}ser \textbf{R}etention \textbf{O}ptimization (AURO) to address this challenge. To navigate the recommendation policy in non-stationary environments, AURO introduces an state abstraction module in the policy network. The module is trained with a new value-based loss function, aligning its…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques
