Large Language Model driven Policy Exploration for Recommender Systems

Jie Wang; Alexandros Karatzoglou; Ioannis Arapakis; Joemon M. Jose

arXiv:2501.13816·cs.IR·January 24, 2025

Large Language Model driven Policy Exploration for Recommender Systems

Jie Wang, Alexandros Karatzoglou, Ioannis Arapakis, Joemon M. Jose

PDF

Open Access

TL;DR

This paper introduces a novel RL-based recommender system approach that leverages Large Language Models to better understand user preferences, improve policy training, and adaptively balance exploration and exploitation in online environments.

Contribution

It proposes the Interaction-Augmented Learned Policy (iALP) using LLMs for pre-training and introduces adaptive variants A-iALP for online deployment, addressing distribution shift and exploration challenges.

Findings

01

A-iALP significantly improves recommendation performance in simulated environments.

02

Adaptive strategies effectively mitigate policy degradation and enhance exploration.

03

LLM-based pre-training enhances initial recommendation quality.

Abstract

Recent advancements in Recommender Systems (RS) have incorporated Reinforcement Learning (RL), framing the recommendation as a Markov Decision Process (MDP). However, offline RL policies trained on static user data are vulnerable to distribution shift when deployed in dynamic online environments. Additionally, excessive focus on exploiting short-term relevant items can hinder exploration, leading to suboptimal recommendations and negatively impacting long-term user gains. Online RL-based RS also face challenges in production deployment, due to the risks of exposing users to untrained or unstable policies. Large Language Models (LLMs) offer a promising solution to mimic user objectives and preferences for pre-training policies offline to enhance the initial recommendations in online settings. Effectively managing distribution shift and balancing exploration are crucial for improving…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques · Topic Modeling

MethodsFocus