DialogVED: A Pre-trained Latent Variable Encoder-Decoder Model for   Dialog Response Generation

Wei Chen; Yeyun Gong; Song Wang; Bolun Yao; Weizhen Qi; Zhongyu Wei,; Xiaowu Hu; Bartuer Zhou; Yi Mao; Weizhu Chen; Biao Cheng; Nan Duan

arXiv:2204.13031·cs.CL·November 1, 2022

DialogVED: A Pre-trained Latent Variable Encoder-Decoder Model for Dialog Response Generation

Wei Chen, Yeyun Gong, Song Wang, Bolun Yao, Weizhen Qi, Zhongyu Wei,, Xiaowu Hu, Bartuer Zhou, Yi Mao, Weizhu Chen, Biao Cheng, Nan Duan

PDF

1 Repo

TL;DR

DialogVED is a novel pre-trained model that incorporates continuous latent variables and turn structure modeling to generate more relevant and diverse open-domain dialog responses, achieving state-of-the-art results.

Contribution

Introduces DialogVED, a pre-trained encoder-decoder with latent variables and turn structure modeling for improved dialog response generation.

Findings

01

Achieves state-of-the-art results on PersonaChat, DailyDialog, and DSTC7-AVSD.

02

Enhances response relevance and diversity through latent variables.

03

Utilizes a multi-task pre-training framework with four tasks.

Abstract

Dialog response generation in open domain is an important research topic where the main challenge is to generate relevant and diverse responses. In this paper, we propose a new dialog pre-training framework called DialogVED, which introduces continuous latent variables into the enhanced encoder-decoder pre-training framework to increase the relevance and diversity of responses. With the help of a large dialog corpus (Reddit), we pre-train the model using the following 4 tasks adopted in language models (LMs) and variational autoencoders (VAEs): 1) masked language model; 2) response generation; 3) bag-of-words prediction; and 4) KL divergence reduction. We also add additional parameters to model the turn structure in dialogs to improve the performance of the pre-trained model. We conduct experiments on PersonaChat, DailyDialog, and DSTC7-AVSD benchmarks for response generation.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lemuria-wchen/DialogVED
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.