LLM-Enhanced Reinforcement Learning for Long-Term User Satisfaction in Interactive Recommendation

Chongjun Xia; Yanchun Peng; Xianzhi Wang

arXiv:2601.19585·cs.IR·May 12, 2026

LLM-Enhanced Reinforcement Learning for Long-Term User Satisfaction in Interactive Recommendation

Chongjun Xia, Yanchun Peng, Xianzhi Wang

PDF

1 Repo

TL;DR

This paper introduces LERL, a hierarchical recommendation framework combining LLM-based semantic planning with reinforcement learning to enhance long-term user satisfaction and content diversity.

Contribution

It proposes a novel hierarchical approach that integrates large language models with reinforcement learning for improved long-term recommendation performance.

Findings

01

LERL significantly outperforms state-of-the-art baselines in long-term user satisfaction.

02

The hierarchical design reduces action space and improves planning efficiency.

03

Experiments on real-world datasets validate the effectiveness of LERL.

Abstract

Interactive recommender systems can dynamically adapt to user feedback, but often suffer from content homogeneity and filter bubble effects due to overfitting short-term user preferences. While recent efforts aim to improve content diversity, they predominantly operate in static or one-shot settings, neglecting the long-term evolution of user interests. Reinforcement learning provides a principled framework for optimizing long-term user satisfaction by modeling sequential decision-making processes. However, its application in recommendation is hindered by sparse, long-tailed user-item interactions and limited semantic planning capabilities. In this work, we propose LLM-Enhanced Reinforcement Learning (LERL), a novel hierarchical recommendation framework that integrates the semantic planning power of LLM with the fine-grained adaptability of RL. LERL consists of a high-level LLM-based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

1163710212/LERL
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.