MUSE: Multi-Domain Chinese User Simulation via Self-Evolving Profiles and Rubric-Guided Alignment

Zihao Liu; Hantao Zhou; Jiguo Li; Jun Xu; Jiuchong Gao; Jinghua Hao; Renqing He; Peng Wang

arXiv:2604.13828·cs.CL·April 16, 2026

MUSE: Multi-Domain Chinese User Simulation via Self-Evolving Profiles and Rubric-Guided Alignment

Zihao Liu, Hantao Zhou, Jiguo Li, Jun Xu, Jiuchong Gao, Jinghua Hao, Renqing He, Peng Wang

PDF

TL;DR

MUSE is a multi-domain Chinese user simulation framework that uses self-evolving profiles and rubric-guided reinforcement learning to produce realistic, coherent, and persona-consistent responses over long interactions.

Contribution

The paper introduces MUSE, a novel multi-domain Chinese user simulator with self-evolving profiles and rubric-guided training, improving response realism and long-term consistency.

Findings

01

MUSE outperforms baselines in realism and coherence.

02

It maintains persona consistency over extended dialogues.

03

The framework enhances multi-turn response quality.

Abstract

User simulators are essential for the scalable training and evaluation of interactive AI systems. However, existing approaches often rely on shallow user profiling, struggle to maintain persona consistency over long interactions, and are largely limited to English or single-domain settings. We present MUSE, a multi-domain Chinese user simulation framework designed to generate human-like, controllable, and behaviorally consistent responses. First, we propose Iterative Profile Self-Evolution (IPSE), which gradually optimizes user profiles by comparing and reasoning discrepancies between simulated trajectories and real dialogue behaviors. We then apply Role-Reversal Supervised Fine-Tuning to improve local response realism and human-like expression. To enable fine-grained behavioral alignment, we further train a specialized rubric-based reward model and incorporate it into rubric-guided…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.