DIAL: Direct Iterative Adversarial Learning for Realistic Multi-Turn Dialogue Simulation

Ziyi Zhu; Olivier Tieleman; Caitlin A. Stamatis; Luka Smyth; Thomas D. Hull; Daniel R. Cahn; Jinghong Chen; Matteo Malgaroli

arXiv:2512.20773·cs.CL·May 7, 2026

DIAL: Direct Iterative Adversarial Learning for Realistic Multi-Turn Dialogue Simulation

Ziyi Zhu, Olivier Tieleman, Caitlin A. Stamatis, Luka Smyth, Thomas D. Hull, Daniel R. Cahn, Jinghong Chen, Matteo Malgaroli

PDF

TL;DR

DIAL is an adversarial learning framework that improves the realism of multi-turn dialogue user simulators, especially in sensitive domains like mental health support, by enhancing diversity and failure detection capabilities.

Contribution

Introduces DIAL, a novel adversarial approach that iteratively refines user simulators to better mimic human behavior and identify system failures.

Findings

01

Restores lexical diversity in simulated dialogues.

02

Reduces discriminator accuracy, indicating more realistic simulation.

03

Strong correlation between simulated and real failure rates.

Abstract

Realistic user simulation is crucial for training and evaluating multi-turn dialogue systems, yet creating simulators that accurately replicate human behavior remains a significant challenge. An effective simulator must expose the failure modes of the systems under evaluation. This work introduces Direct Iterative Adversarial Learning (DIAL), an adversarial framework that iteratively enhances user simulator realism through a competitive dynamic between a generator (user simulator) and a discriminator. When applied to mental health support, a domain characterized by diverse failure types and a critical dependence on realistic user behavior for failure detection, DIAL restores lexical diversity diminished by supervised fine-tuning and drastically reduces discriminator accuracy. The resulting simulator exhibits a strong correlation between simulated and real failure occurrence rates while…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.