Continuously Learning Neural Dialogue Management

Pei-Hao Su; Milica Gasic; Nikola Mrksic; Lina Rojas-Barahona; Stefan; Ultes; David Vandyke; Tsung-Hsien Wen; Steve Young

arXiv:1606.02689·cs.CL·June 9, 2016·105 cites

Continuously Learning Neural Dialogue Management

Pei-Hao Su, Milica Gasic, Nikola Mrksic, Lina Rojas-Barahona, Stefan, Ultes, David Vandyke, Tsung-Hsien Wen, Steve Young

PDF

Open Access

TL;DR

This paper presents a neural dialogue management system that learns from data and continuously improves through reinforcement learning, enhancing task-oriented spoken dialogue performance in noisy environments.

Contribution

It introduces a unified neural network framework for supervised learning and reinforcement learning in dialogue management, enabling continuous improvement within a single model.

Findings

01

Supervised model performs well in corpus-based evaluation.

02

Reinforcement learning enhances performance in interactive settings.

03

Model is robust under high-noise conditions.

Abstract

We describe a two-step approach for dialogue management in task-oriented spoken dialogue systems. A unified neural network framework is proposed to enable the system to first learn by supervision from a set of dialogue data and then continuously improve its behaviour via reinforcement learning, all using gradient-based algorithms on one single model. The experiments demonstrate the supervised model's effectiveness in the corpus-based evaluation, with user simulation, and with paid human subjects. The use of reinforcement learning further improves the model's performance in both interactive settings, especially under higher-noise conditions.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and dialogue systems · Topic Modeling · Intelligent Tutoring Systems and Adaptive Learning