Neural User Simulation for Corpus-based Policy Optimisation for Spoken Dialogue Systems
Florian Kreyssig, Inigo Casanueva, Pawel Budzianowski, Milica Gasic

TL;DR
This paper introduces a neural user simulator that learns from dialogue corpora to generate natural language, improving policy training for spoken dialogue systems over traditional rule-based simulators.
Contribution
The paper presents a neural user simulator trained on dialogue data that produces natural language, enabling more effective policy training and evaluation for dialogue systems.
Findings
NUS outperforms ABUS in policy training tasks
Policies trained with NUS generalize better to real users
NUS requires less labeled data than semantic-based simulators
Abstract
User Simulators are one of the major tools that enable offline training of task-oriented dialogue systems. For this task the Agenda-Based User Simulator (ABUS) is often used. The ABUS is based on hand-crafted rules and its output is in semantic form. Issues arise from both properties such as limited diversity and the inability to interface a text-level belief tracker. This paper introduces the Neural User Simulator (NUS) whose behaviour is learned from a corpus and which generates natural language, hence needing a less labelled dataset than simulators generating a semantic output. In comparison to much of the past work on this topic, which evaluates user simulators on corpus-based metrics, we use the NUS to train the policy of a reinforcement learning based Spoken Dialogue System. The NUS is compared to the ABUS by evaluating the policies that were trained using the simulators.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
