AlignUSER: Human-Aligned LLM Agents via World Models for Recommender System Evaluation
Nicolas Bougie, Gian Maria Marconi, Tony Yip, and Narimasa Watanabe

TL;DR
AlignUSER introduces a novel framework that uses world models and human interaction data to create LLM-based agents that better mimic human behavior for evaluating recommender systems.
Contribution
It presents a new approach to training LLM agents with world models and counterfactual reasoning, improving their alignment with human preferences in recommender system evaluation.
Findings
AlignUSER achieves closer alignment with human behavior than previous methods.
The framework improves the fidelity of synthetic user interactions.
It demonstrates effectiveness across multiple datasets.
Abstract
Evaluating recommender systems remains challenging due to the gap between offline metrics and real user behavior, as well as the scarcity of interaction data. Recent work explores large language model (LLM) agents as synthetic users, yet they typically rely on few-shot prompting, which yields a shallow understanding of the environment and limits their ability to faithfully reproduce user actions. We introduce AlignUSER, a framework that learns world-model-driven agents from human interactions. Given rollout sequences of actions and states, we formalize world modeling as a next state prediction task that helps the agent internalize the environment. To align actions with human personas, we generate counterfactual trajectories around demonstrations and prompt the LLM to compare its decisions with human choices, identify suboptimal actions, and extract lessons. The learned policy is then…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPersona Design and Applications · Multimodal Machine Learning Applications · Topic Modeling
