AURO: Reinforcement Learning for Adaptive User Retention Optimization in   Recommender Systems

Zhenghai Xue; Qingpeng Cai; Bin Yang; Lantao Hu; Peng Jiang; Kun Gai,; Bo An

arXiv:2310.03984·cs.IR·February 27, 2025

AURO: Reinforcement Learning for Adaptive User Retention Optimization in Recommender Systems

Zhenghai Xue, Qingpeng Cai, Bin Yang, Lantao Hu, Peng Jiang, Kun Gai,, Bo An

PDF

Open Access

TL;DR

This paper presents AURO, a reinforcement learning approach that adapts to non-stationary user behaviors in recommender systems, improving retention by dynamically adjusting policies through state abstraction and performance-aware exploration.

Contribution

AURO introduces a novel state abstraction module and a performance-based loss function to handle environment non-stationarity in reinforcement learning for recommendations.

Findings

01

AURO outperforms baseline algorithms in user retention simulations.

02

AURO demonstrates superior performance on MovieLens and live video platforms.

03

The approach effectively adapts to evolving user behavior patterns.

Abstract

The field of Reinforcement Learning (RL) has garnered increasing attention for its ability of optimizing user retention in recommender systems. A primary obstacle in this optimization process is the environment non-stationarity stemming from the continual and complex evolution of user behavior patterns over time, such as variations in interaction rates and retention propensities. These changes pose significant challenges to existing RL algorithms for recommendations, leading to issues with dynamics and reward distribution shifts. This paper introduces a novel approach called \textbf{A}daptive \textbf{U}ser \textbf{R}etention \textbf{O}ptimization (AURO) to address this challenge. To navigate the recommendation policy in non-stationary environments, AURO introduces an state abstraction module in the policy network. The module is trained with a new value-based loss function, aligning its…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques