Neural User Simulation for Corpus-based Policy Optimisation for Spoken   Dialogue Systems

Florian Kreyssig; Inigo Casanueva; Pawel Budzianowski; Milica Gasic

arXiv:1805.06966·cs.CL·May 21, 2018

Neural User Simulation for Corpus-based Policy Optimisation for Spoken Dialogue Systems

Florian Kreyssig, Inigo Casanueva, Pawel Budzianowski, Milica Gasic

PDF

TL;DR

This paper introduces a neural user simulator that learns from dialogue corpora to generate natural language, improving policy training for spoken dialogue systems over traditional rule-based simulators.

Contribution

The paper presents a neural user simulator trained on dialogue data that produces natural language, enabling more effective policy training and evaluation for dialogue systems.

Findings

01

NUS outperforms ABUS in policy training tasks

02

Policies trained with NUS generalize better to real users

03

NUS requires less labeled data than semantic-based simulators

Abstract

User Simulators are one of the major tools that enable offline training of task-oriented dialogue systems. For this task the Agenda-Based User Simulator (ABUS) is often used. The ABUS is based on hand-crafted rules and its output is in semantic form. Issues arise from both properties such as limited diversity and the inability to interface a text-level belief tracker. This paper introduces the Neural User Simulator (NUS) whose behaviour is learned from a corpus and which generates natural language, hence needing a less labelled dataset than simulators generating a semantic output. In comparison to much of the past work on this topic, which evaluates user simulators on corpus-based metrics, we use the NUS to train the policy of a reinforcement learning based Spoken Dialogue System. The NUS is compared to the ABUS by evaluating the policies that were trained using the simulators.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.