Classification As Decoder: Trading Flexibility For Control In Neural   Dialogue

Sam Shleifer; Manish Chablani; Namit Katariya; Anitha Kannan; Xavier; Amatriain

arXiv:1910.03476·cs.CL·October 21, 2019

Classification As Decoder: Trading Flexibility For Control In Neural Dialogue

Sam Shleifer, Manish Chablani, Namit Katariya, Anitha Kannan, Xavier, Amatriain

PDF

Open Access

TL;DR

This paper proposes a classification-based approach for neural dialogue systems that offers better control over responses by selecting from predefined options, improving response quality and consistency compared to traditional generative models.

Contribution

It introduces a novel classification framework for dialogue response selection, enabling easier editing and better response quality without retraining the model.

Findings

01

Discriminative approach reduces undesirable responses.

02

Response quality surpasses traditional generative models.

03

Model trained without manual response class labels performs equally well.

Abstract

Generative seq2seq dialogue systems are trained to predict the next word in dialogues that have already occurred. They can learn from large unlabeled conversation datasets, build a deep understanding of conversational context, and generate a wide variety of responses. This flexibility comes at the cost of control. Undesirable responses in the training data will be reproduced by the model at inference time, and longer generations often don't make sense. Instead of generating responses one word at a time, we train a classifier to choose from a predefined list of full responses. The classifier is trained on (conversation context, response class) pairs, where each response class is a noisily labeled group of interchangeable responses. At inference, we generate the exemplar response associated with the predicted response class. Experts can edit and improve these exemplar responses over time…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Sequence to Sequence