Domain-independent User Simulation with Transformers for Task-oriented   Dialogue Systems

Hsien-chin Lin; Nurul Lubis; Songbo Hu; Carel van Niekerk; Christian; Geishauser; Michael Heck; Shutong Feng; Milica Ga\v{s}i\'c

arXiv:2106.08838·cs.CL·June 17, 2021

Domain-independent User Simulation with Transformers for Task-oriented Dialogue Systems

Hsien-chin Lin, Nurul Lubis, Songbo Hu, Carel van Niekerk, Christian, Geishauser, Michael Heck, Shutong Feng, Milica Ga\v{s}i\'c

PDF

TL;DR

This paper introduces a domain-independent transformer-based user simulator for task-oriented dialogue systems that can generalize across domains without retraining, reducing the need for domain-specific data and rules.

Contribution

The authors propose a novel transformer-based user simulator that is domain-independent, enabling zero-shot generalization to new domains in dialogue systems.

Findings

01

TUS competes with rule-based simulators in known domains.

02

TUS generalizes to unseen domains without retraining.

03

Automatic and human evaluations confirm TUS's effectiveness.

Abstract

Dialogue policy optimisation via reinforcement learning requires a large number of training interactions, which makes learning with real users time consuming and expensive. Many set-ups therefore rely on a user simulator instead of humans. These user simulators have their own problems. While hand-coded, rule-based user simulators have been shown to be sufficient in small, simple domains, for complex domains the number of rules quickly becomes intractable. State-of-the-art data-driven user simulators, on the other hand, are still domain-dependent. This means that adaptation to each new domain requires redesigning and retraining. In this work, we propose a domain-independent transformer-based user simulator (TUS). The structure of our TUS is not tied to a specific domain, enabling domain generalisation and learning of cross-domain user behaviour from data. We compare TUS with the state of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.