LifeSim: Long-Horizon User Life Simulator for Personalized Assistant Evaluation
Feiyu Duan, Xuanjing Huang, Zhongyu Wei

TL;DR
LifeSim is a user simulation framework that models long-term user behaviors and cognition to evaluate personalized AI assistants more realistically across diverse scenarios.
Contribution
We introduce LifeSim, a novel user simulator based on the BDI model for realistic long-horizon interactions, and LifeSim-Eval, a comprehensive benchmark for personalized assistant evaluation.
Findings
Current LLMs struggle with implicit intentions.
Long-term user preference modeling remains challenging.
LifeSim reveals limitations of existing models in complex scenarios.
Abstract
The rapid advancement of large language models (LLMs) has accelerated progress toward universal AI assistants. However, existing benchmarks for personalized assistants remain misaligned with real-world user-assistant interactions, failing to capture the complexity of external contexts and users' cognitive states. To bridge this gap, we propose LifeSim, a user simulator that models user cognition through the Belief-Desire-Intention (BDI) model within physical environments for coherent life trajectories generation, and simulates intention-driven user interactive behaviors. Based on LifeSim, we introduce LifeSim-Eval, a comprehensive benchmark for multi-scenario, long-horizon personalized assistance. LifeSim-Eval covers 8 life domains and 1,200 diverse scenarios, and adopts a multi-turn interactive method to assess models' abilities to complete explicit and implicit intentions, recover…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSocial Robot Interaction and HRI · AI in Service Interactions · Multimodal Machine Learning Applications
