Dialogue Learning with Human Teaching and Feedback in End-to-End   Trainable Task-Oriented Dialogue Systems

Bing Liu; Gokhan Tur; Dilek Hakkani-Tur; Pararth Shah; Larry Heck

arXiv:1804.06512·cs.CL·April 19, 2018·20 cites

Dialogue Learning with Human Teaching and Feedback in End-to-End Trainable Task-Oriented Dialogue Systems

Bing Liu, Gokhan Tur, Dilek Hakkani-Tur, Pararth Shah, Larry Heck

PDF

Open Access 1 Repo

TL;DR

This paper introduces a hybrid imitation and reinforcement learning approach for training task-oriented dialogue systems through online human interactions, improving learning efficiency and task success.

Contribution

It proposes a novel hybrid learning method combining imitation and reinforcement learning for end-to-end dialogue systems trained via human feedback.

Findings

01

Effective learning from user teaching via imitation learning.

02

Reinforcement learning with user feedback enhances task completion.

03

End-to-end neural dialogue agent improves with the proposed method.

Abstract

In this work, we present a hybrid learning method for training task-oriented dialogue systems through online user interactions. Popular methods for learning task-oriented dialogues include applying reinforcement learning with user feedback on supervised pre-training models. Efficiency of such learning method may suffer from the mismatch of dialogue state distribution between offline training and online interactive learning stages. To address this challenge, we propose a hybrid imitation and reinforcement learning method, with which a dialogue agent can effectively learn from its interaction with users by learning from human teaching and feedback. We design a neural network based task-oriented dialogue agent that can be optimized end-to-end with the proposed learning method. Experimental results show that our end-to-end dialogue agent can learn effectively from the mistake it makes via…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

google-research-datasets/simulated-dialogue
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and dialogue systems · Topic Modeling · Multi-Agent Systems and Negotiation