Know You First and Be You Better: Modeling Human-Like User Simulators via Implicit Profiles

Kuang Wang; Xianfei Li; Shenghao Yang; Li Zhou; Feng Jiang; Haizhou Li

arXiv:2502.18968·cs.CL·July 1, 2025

Know You First and Be You Better: Modeling Human-Like User Simulators via Implicit Profiles

Kuang Wang, Xianfei Li, Shenghao Yang, Li Zhou, Feng Jiang, Haizhou Li

PDF

1 Repo 2 Models 1 Datasets 1 Video

TL;DR

This paper introduces USP, a novel user simulator that infers implicit user profiles from interactions, enhancing realism, diversity, and consistency in dialogue simulation for training and evaluation of language models.

Contribution

We propose a framework that infers implicit user profiles from interactions and refines simulation with supervised fine-tuning and reinforcement learning, improving authenticity and diversity.

Findings

01

USP outperforms baselines in authenticity and diversity.

02

USP maintains conversation-level consistency.

03

Effective in evaluating LLMs on real-world benchmarks.

Abstract

User simulators are crucial for replicating human interactions with dialogue systems, supporting both collaborative training and automatic evaluation, especially for large language models (LLMs). However, current role-playing methods face challenges such as a lack of utterance-level authenticity and user-level diversity, often hindered by role confusion and dependence on predefined profiles of well-known figures. In contrast, direct simulation focuses solely on text, neglecting implicit user traits like personality and conversation-level consistency. To address these issues, we introduce the User Simulator with Implicit Profiles (USP), a framework that infers implicit user profiles from human-machine interactions to simulate personalized and realistic dialogues. We first develop an LLM-driven extractor with a comprehensive profile schema, then refine the simulation using conditional…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wangkevin02/USP
pytorchOfficial

Models

Datasets

wangkevin02/LMSYS-USP
dataset· 93 dl
93 dl

Videos

Know You First and Be You Better: Modeling Human-Like User Simulators via Implicit Profiles· underline

Taxonomy

MethodsALIGN · ADaptive gradient method with the OPTimal convergence rate