Diversifying Neural Dialogue Generation via Negative Distillation

Yiwei Li; Shaoxiong Feng; Bin Sun; Kan Li

arXiv:2205.02795·cs.CL·May 6, 2022

Diversifying Neural Dialogue Generation via Negative Distillation

Yiwei Li, Shaoxiong Feng, Bin Sun, Kan Li

PDF

Open Access

TL;DR

This paper introduces negative distillation, a novel training method for neural dialogue models that effectively reduces generic responses by using a negative teacher model and multi-level negative knowledge, improving response diversity.

Contribution

The paper proposes negative distillation, a new negative training paradigm utilizing a negative teacher model and multi-level negative knowledge to enhance dialogue response diversity.

Findings

01

Outperforms previous negative training methods significantly

02

Effectively reduces generic responses in dialogue generation

03

Improves diversity of generated responses

Abstract

Generative dialogue models suffer badly from the generic response problem, limiting their applications to a few toy scenarios. Recently, an interesting approach, namely negative training, has been proposed to alleviate this problem by reminding the model not to generate high-frequency responses during training. However, its performance is hindered by two issues, ignoring low-frequency but generic responses and bringing low-frequency but meaningless responses. In this paper, we propose a novel negative training paradigm, called negative distillation, to keep the model away from the undesirable generic responses while avoiding the above problems. First, we introduce a negative teacher model that can produce query-wise generic responses, and then the student model is required to maximize the distance with multi-level negative knowledge. Empirical results show that our method outperforms…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · Multimodal Machine Learning Applications