Large Language Model driven Policy Exploration for Recommender Systems
Jie Wang, Alexandros Karatzoglou, Ioannis Arapakis, Joemon M. Jose

TL;DR
This paper introduces a novel RL-based recommender system approach that leverages Large Language Models to better understand user preferences, improve policy training, and adaptively balance exploration and exploitation in online environments.
Contribution
It proposes the Interaction-Augmented Learned Policy (iALP) using LLMs for pre-training and introduces adaptive variants A-iALP for online deployment, addressing distribution shift and exploration challenges.
Findings
A-iALP significantly improves recommendation performance in simulated environments.
Adaptive strategies effectively mitigate policy degradation and enhance exploration.
LLM-based pre-training enhances initial recommendation quality.
Abstract
Recent advancements in Recommender Systems (RS) have incorporated Reinforcement Learning (RL), framing the recommendation as a Markov Decision Process (MDP). However, offline RL policies trained on static user data are vulnerable to distribution shift when deployed in dynamic online environments. Additionally, excessive focus on exploiting short-term relevant items can hinder exploration, leading to suboptimal recommendations and negatively impacting long-term user gains. Online RL-based RS also face challenges in production deployment, due to the risks of exposing users to untrained or unstable policies. Large Language Models (LLMs) offer a promising solution to mimic user objectives and preferences for pre-training policies offline to enhance the initial recommendations in online settings. Effectively managing distribution shift and balancing exploration are crucial for improving…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Topic Modeling
MethodsFocus
