Adversarial Learning for Neural Dialogue Generation
Jiwei Li, Will Monroe, Tianlin Shi, S\'ebastien Jean, Alan Ritter and, Dan Jurafsky

TL;DR
This paper introduces an adversarial training approach for neural dialogue generation, where a generator and discriminator are trained jointly to produce human-like conversations, improving response quality over previous methods.
Contribution
It proposes a novel adversarial training framework for dialogue generation and an evaluation method based on fooling an adversary, advancing dialogue system development.
Findings
Adversarial training yields more human-like responses.
The system outperforms previous baselines on multiple metrics.
Adversarial evaluation correlates with response quality.
Abstract
In this paper, drawing intuition from the Turing test, we propose using adversarial training for open-domain dialogue generation: the system is trained to produce sequences that are indistinguishable from human-generated dialogue utterances. We cast the task as a reinforcement learning (RL) problem where we jointly train two systems, a generative model to produce response sequences, and a discriminator---analagous to the human evaluator in the Turing test--- to distinguish between the human-generated dialogues and the machine-generated ones. The outputs from the discriminator are then used as rewards for the generative model, pushing the system to generate dialogues that mostly resemble human dialogues. In addition to adversarial training we describe a model for adversarial {\em evaluation} that uses success in fooling an adversary as a dialogue evaluation metric, while avoiding a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
