Lusifer: LLM-based User SImulated Feedback Environment for online Recommender systems
Danial Ebrat, Eli Paradalis, Luis Rueda

TL;DR
Lusifer is a novel LLM-based simulation environment that generates dynamic, explainable user feedback for reinforcement learning recommender systems, addressing the limitations of static datasets and enhancing adaptability to real-world user behavior changes.
Contribution
This paper introduces Lusifer, a new environment leveraging large language models to simulate evolving user preferences and feedback, improving upon traditional static data approaches in recommender system training.
Findings
Lusifer achieves comparable predictive accuracy to collaborative filtering models.
It effectively captures dynamic user responses and provides explainable feedback.
The environment supports cold start scenarios and out-of-distribution adaptation.
Abstract
Reinforcement learning (RL) recommender systems often rely on static datasets that fail to capture the fluid, ever changing nature of user preferences in real-world scenarios. Meanwhile, generative AI techniques have emerged as powerful tools for creating synthetic data, including user profiles and behaviors. Recognizing this potential, we introduce Lusifer, an LLM-based simulation environment designed to generate dynamic, realistic user feedback for RL-based recommender training. In Lusifer, user profiles are incrementally updated at each interaction step, with Large Language Models (LLMs) providing transparent explanations of how and why preferences evolve. We focus on the MovieLens dataset, extracting only the last 40 interactions for each user, to emphasize recent behavior. By processing textual metadata (such as movie overviews and tags) Lusifer creates more context aware user…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Advanced Text Analysis Techniques · Image Retrieval and Classification Techniques
MethodsAttentive Walk-Aggregating Graph Neural Network · Focus
