Classification As Decoder: Trading Flexibility For Control In Neural Dialogue
Sam Shleifer, Manish Chablani, Namit Katariya, Anitha Kannan, Xavier, Amatriain

TL;DR
This paper proposes a classification-based approach for neural dialogue systems that offers better control over responses by selecting from predefined options, improving response quality and consistency compared to traditional generative models.
Contribution
It introduces a novel classification framework for dialogue response selection, enabling easier editing and better response quality without retraining the model.
Findings
Discriminative approach reduces undesirable responses.
Response quality surpasses traditional generative models.
Model trained without manual response class labels performs equally well.
Abstract
Generative seq2seq dialogue systems are trained to predict the next word in dialogues that have already occurred. They can learn from large unlabeled conversation datasets, build a deep understanding of conversational context, and generate a wide variety of responses. This flexibility comes at the cost of control. Undesirable responses in the training data will be reproduced by the model at inference time, and longer generations often don't make sense. Instead of generating responses one word at a time, we train a classifier to choose from a predefined list of full responses. The classifier is trained on (conversation context, response class) pairs, where each response class is a noisily labeled group of interchangeable responses. At inference, we generate the exemplar response associated with the predicted response class. Experts can edit and improve these exemplar responses over time…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Sequence to Sequence
