Transformer-Based Conditioned Variational Autoencoder for Dialogue   Generation

Huihui Yang

arXiv:2210.12326·cs.CL·October 25, 2022

Transformer-Based Conditioned Variational Autoencoder for Dialogue Generation

Huihui Yang

PDF

Open Access

TL;DR

This paper introduces CVAE-T, a Transformer-based conditioned variational autoencoder for dialogue generation, which improves response informativeness by explicitly modeling semantic differences using negative examples.

Contribution

The paper proposes a novel Transformer-based CVAE model that incorporates negative examples and regularization to generate more diverse and informative dialogue responses.

Findings

01

Produces more informative replies

02

Effectively models semantic differences

03

Outperforms baseline models in diversity

Abstract

In human dialogue, a single query may elicit numerous appropriate responses. The Transformer-based dialogue model produces frequently occurring sentences in the corpus since it is a one-to-one mapping function. CVAE is a technique for reducing generic replies. In this paper, we create a new dialogue model (CVAE-T) based on the Transformer with CVAE structure. We use a pre-trained MLM model to rewrite some key n-grams in responses to obtain a series of negative examples, and introduce a regularization term during training to explicitly guide the latent variable in learning the semantic differences between each pair of positive and negative examples. Experiments suggest that the method we design is capable of producing more informative replies.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Dense Connections · Softmax · Position-Wise Feed-Forward Layer · Adam · Label Smoothing · Absolute Position Encodings · Layer Normalization