Reinforcement Learning-based Recommender Systems with Large Language   Models for State Reward and Action Modeling

Jie Wang; Alexandros Karatzoglou; Ioannis Arapakis; Joemon M. Jose

arXiv:2403.16948·cs.IR·March 26, 2024·2 cites

Reinforcement Learning-based Recommender Systems with Large Language Models for State Reward and Action Modeling

Jie Wang, Alexandros Karatzoglou, Ioannis Arapakis, Joemon M. Jose

PDF

Open Access

TL;DR

This paper introduces a novel approach that leverages large language models as an environment to improve reinforcement learning-based recommender systems by better modeling user states and rewards, especially in offline settings.

Contribution

It proposes using large language models as a state and reward model to enhance offline RL recommenders, reducing data needs and synthesizing user feedback.

Findings

01

Improved recommendation accuracy on benchmark datasets.

02

Effective user state and reward modeling with LLMs.

03

Enhanced data augmentation through generated positive actions.

Abstract

Reinforcement Learning (RL)-based recommender systems have demonstrated promising performance in meeting user expectations by learning to make accurate next-item recommendations from historical user-item interactions. However, existing offline RL-based sequential recommendation methods face the challenge of obtaining effective user feedback from the environment. Effectively modeling the user state and shaping an appropriate reward for recommendation remains a challenge. In this paper, we leverage language understanding capabilities and adapt large language models (LLMs) as an environment (LE) to enhance RL-based recommenders. The LE is learned from a subset of user-item interaction data, thus reducing the need for large training data, and can synthesise user feedback for offline data by: (i) acting as a state model that produces high quality states that enrich the user representation,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Text Analysis Techniques · Recommender Systems and Techniques