TL;DR
This paper formally defines distinct objectives for user simulation in conversational information access, highlighting the importance of aligning simulator design with specific goals for training or evaluation.
Contribution
It introduces a formal characterization of user simulation objectives and demonstrates the need for tailored design based on the intended use.
Findings
Optimizing for training does not improve evaluation performance.
Clear objectives and measures are essential for effective user simulators.
Tailored simulators enhance the development of conversational agents.
Abstract
User simulation is a promising approach for automatically training and evaluating conversational information access agents, enabling the generation of synthetic dialogues and facilitating reproducible experiments at scale. However, the objectives of user simulation for the different uses remain loosely defined, hindering the development of effective simulators. In this work, we formally characterize the distinct objectives for user simulators: training aims to maximize behavioral similarity to real users, while evaluation focuses on the accurate prediction of real-world conversational agent performance. Through an empirical study, we demonstrate that optimizing for one objective does not necessarily lead to improved performance on the other. This finding underscores the need for tailored design considerations depending on the intended use of the simulator. By establishing clear…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
