Domain-independent User Simulation with Transformers for Task-oriented Dialogue Systems
Hsien-chin Lin, Nurul Lubis, Songbo Hu, Carel van Niekerk, Christian, Geishauser, Michael Heck, Shutong Feng, Milica Ga\v{s}i\'c

TL;DR
This paper introduces a domain-independent transformer-based user simulator for task-oriented dialogue systems that can generalize across domains without retraining, reducing the need for domain-specific data and rules.
Contribution
The authors propose a novel transformer-based user simulator that is domain-independent, enabling zero-shot generalization to new domains in dialogue systems.
Findings
TUS competes with rule-based simulators in known domains.
TUS generalizes to unseen domains without retraining.
Automatic and human evaluations confirm TUS's effectiveness.
Abstract
Dialogue policy optimisation via reinforcement learning requires a large number of training interactions, which makes learning with real users time consuming and expensive. Many set-ups therefore rely on a user simulator instead of humans. These user simulators have their own problems. While hand-coded, rule-based user simulators have been shown to be sufficient in small, simple domains, for complex domains the number of rules quickly becomes intractable. State-of-the-art data-driven user simulators, on the other hand, are still domain-dependent. This means that adaptation to each new domain requires redesigning and retraining. In this work, we propose a domain-independent transformer-based user simulator (TUS). The structure of our TUS is not tied to a specific domain, enabling domain generalisation and learning of cross-domain user behaviour from data. We compare TUS with the state of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
