Classification as Decoder: Trading Flexibility for Control in Medical Dialogue
Sam Shleifer, Manish Chablani, Anitha Kannan, Namit Katariya, Xavier, Amatriain

TL;DR
This paper introduces a classification-based approach for medical dialogue systems that improves response quality control by mapping conversation context to response classes, balancing flexibility with safety.
Contribution
It proposes a novel method that replaces generative models with a classification head on a pretrained language model for better response control in medical dialogues.
Findings
Discriminative model's responses are worse than doctors' 12% of the time.
Generative model's responses are worse than doctors' 18% of the time.
The approach allows easy updating of responses without retraining the entire system.
Abstract
Generative seq2seq dialogue systems are trained to predict the next word in dialogues that have already occurred. They can learn from large unlabeled conversation datasets, build a deeper understanding of conversational context, and generate a wide variety of responses. This flexibility comes at the cost of control, a concerning tradeoff in doctor/patient interactions. Inaccuracies, typos, or undesirable content in the training data will be reproduced by the model at inference time. We trade a small amount of labeling effort and some loss of response variety in exchange for quality control. More specifically, a pretrained language model encodes the conversational context, and we finetune a classification head to map an encoded conversational context to a response class, where each class is a noisily labeled group of interchangeable responses. Experts can update these exemplar responses…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Sequence to Sequence
