Continuously Learning Neural Dialogue Management
Pei-Hao Su, Milica Gasic, Nikola Mrksic, Lina Rojas-Barahona, Stefan, Ultes, David Vandyke, Tsung-Hsien Wen, Steve Young

TL;DR
This paper presents a neural dialogue management system that learns from data and continuously improves through reinforcement learning, enhancing task-oriented spoken dialogue performance in noisy environments.
Contribution
It introduces a unified neural network framework for supervised learning and reinforcement learning in dialogue management, enabling continuous improvement within a single model.
Findings
Supervised model performs well in corpus-based evaluation.
Reinforcement learning enhances performance in interactive settings.
Model is robust under high-noise conditions.
Abstract
We describe a two-step approach for dialogue management in task-oriented spoken dialogue systems. A unified neural network framework is proposed to enable the system to first learn by supervision from a set of dialogue data and then continuously improve its behaviour via reinforcement learning, all using gradient-based algorithms on one single model. The experiments demonstrate the supervised model's effectiveness in the corpus-based evaluation, with user simulation, and with paid human subjects. The use of reinforcement learning further improves the model's performance in both interactive settings, especially under higher-noise conditions.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems · Topic Modeling · Intelligent Tutoring Systems and Adaptive Learning
