Classification as Decoder: Trading Flexibility for Control in Medical   Dialogue

Sam Shleifer; Manish Chablani; Anitha Kannan; Namit Katariya; Xavier; Amatriain

arXiv:1911.08554·cs.CL·November 21, 2019

Classification as Decoder: Trading Flexibility for Control in Medical Dialogue

Sam Shleifer, Manish Chablani, Anitha Kannan, Namit Katariya, Xavier, Amatriain

PDF

Open Access

TL;DR

This paper introduces a classification-based approach for medical dialogue systems that improves response quality control by mapping conversation context to response classes, balancing flexibility with safety.

Contribution

It proposes a novel method that replaces generative models with a classification head on a pretrained language model for better response control in medical dialogues.

Findings

01

Discriminative model's responses are worse than doctors' 12% of the time.

02

Generative model's responses are worse than doctors' 18% of the time.

03

The approach allows easy updating of responses without retraining the entire system.

Abstract

Generative seq2seq dialogue systems are trained to predict the next word in dialogues that have already occurred. They can learn from large unlabeled conversation datasets, build a deeper understanding of conversational context, and generate a wide variety of responses. This flexibility comes at the cost of control, a concerning tradeoff in doctor/patient interactions. Inaccuracies, typos, or undesirable content in the training data will be reproduced by the model at inference time. We trade a small amount of labeling effort and some loss of response variety in exchange for quality control. More specifically, a pretrained language model encodes the conversational context, and we finetune a classification head to map an encoded conversational context to a response class, where each class is a noisily labeled group of interchangeable responses. Experts can update these exemplar responses…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Sequence to Sequence