SMRT Chatbots: Improving Non-Task-Oriented Dialog with Simulated   Multiple Reference Training

Huda Khayrallah; Jo\~ao Sedoc

arXiv:2011.00547·cs.CL·April 23, 2021

SMRT Chatbots: Improving Non-Task-Oriented Dialog with Simulated Multiple Reference Training

Huda Khayrallah, Jo\~ao Sedoc

PDF

TL;DR

This paper introduces SMRT, a training method that uses simulated multiple responses to enhance non-task-oriented chatbots, resulting in improved response quality and diversity without needing additional domain-specific data.

Contribution

The paper presents SMRT, a novel training approach that leverages paraphrasing to simulate multiple responses, significantly improving chatbot response quality and diversity.

Findings

01

SMRT outperforms a strong Transformer baseline in quality and diversity.

02

SMRT is comparable to pretraining in human evaluations.

03

SMRT surpasses pretraining on automatic quality and lexical diversity.

Abstract

Non-task-oriented dialog models suffer from poor quality and non-diverse responses. To overcome limited conversational data, we apply Simulated Multiple Reference Training (SMRT; Khayrallah et al., 2020), and use a paraphraser to simulate multiple responses per training prompt. We find SMRT improves over a strong Transformer baseline as measured by human and automatic quality scores and lexical diversity. We also find SMRT is comparable to pretraining in human evaluation quality, and outperforms pretraining on automatic quality and lexical diversity, without requiring related-domain dialog data.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Adam · Residual Connection · Dropout · Multi-Head Attention · Byte Pair Encoding · Softmax · Dense Connections