Adversarial Learning for Neural Dialogue Generation

Jiwei Li; Will Monroe; Tianlin Shi; S\'ebastien Jean; Alan Ritter and; Dan Jurafsky

arXiv:1701.06547·cs.CL·September 26, 2017·221 cites

Adversarial Learning for Neural Dialogue Generation

Jiwei Li, Will Monroe, Tianlin Shi, S\'ebastien Jean, Alan Ritter and, Dan Jurafsky

PDF

Open Access 5 Repos

TL;DR

This paper introduces an adversarial training approach for neural dialogue generation, where a generator and discriminator are trained jointly to produce human-like conversations, improving response quality over previous methods.

Contribution

It proposes a novel adversarial training framework for dialogue generation and an evaluation method based on fooling an adversary, advancing dialogue system development.

Findings

01

Adversarial training yields more human-like responses.

02

The system outperforms previous baselines on multiple metrics.

03

Adversarial evaluation correlates with response quality.

Abstract

In this paper, drawing intuition from the Turing test, we propose using adversarial training for open-domain dialogue generation: the system is trained to produce sequences that are indistinguishable from human-generated dialogue utterances. We cast the task as a reinforcement learning (RL) problem where we jointly train two systems, a generative model to produce response sequences, and a discriminator---analagous to the human evaluator in the Turing test--- to distinguish between the human-generated dialogues and the machine-generated ones. The outputs from the discriminator are then used as rewards for the generative model, pushing the system to generate dialogues that mostly resemble human dialogues. In addition to adversarial training we describe a model for adversarial {\em evaluation} that uses success in fooling an adversary as a dialogue evaluation metric, while avoiding a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications