Multi-Referenced Training for Dialogue Response Generation

Tianyu Zhao; Tatsuya Kawahara

arXiv:2009.07117·cs.CL·October 20, 2020

Multi-Referenced Training for Dialogue Response Generation

Tianyu Zhao, Tatsuya Kawahara

PDF

1 Repo

TL;DR

This paper introduces multi-referenced training methods for dialogue response generation, utilizing diverse pseudo references and an expressive variational prior to better model one-to-many responses, leading to significant improvements.

Contribution

It proposes a novel multi-referenced training approach combining data augmentation with pseudo references and an expressive variational prior, enhancing dialogue response diversity.

Findings

01

Significant improvements in automated and human evaluations.

02

Effective use of pseudo references from pretrained models.

03

Enhanced modeling of one-to-many dialogue responses.

Abstract

In open-domain dialogue response generation, a dialogue context can be continued with diverse responses, and the dialogue models should capture such one-to-many relations. In this work, we first analyze the training objective of dialogue models from the view of Kullback-Leibler divergence (KLD) and show that the gap between the real world probability distribution and the single-referenced data's probability distribution prevents the model from learning the one-to-many relations efficiently. Then we explore approaches to multi-referenced training in two aspects. Data-wise, we generate diverse pseudo references from a powerful pretrained model to build multi-referenced data that provides a better approximation of the real-world distribution. Model-wise, we propose to equip variational models with an expressive prior, named linear Gaussian model (LGM). Experimental results of automated…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ZHAOTING/dialog-processing
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.