Transformer-Based Conditioned Variational Autoencoder for Dialogue Generation
Huihui Yang

TL;DR
This paper introduces CVAE-T, a Transformer-based conditioned variational autoencoder for dialogue generation, which improves response informativeness by explicitly modeling semantic differences using negative examples.
Contribution
The paper proposes a novel Transformer-based CVAE model that incorporates negative examples and regularization to generate more diverse and informative dialogue responses.
Findings
Produces more informative replies
Effectively models semantic differences
Outperforms baseline models in diversity
Abstract
In human dialogue, a single query may elicit numerous appropriate responses. The Transformer-based dialogue model produces frequently occurring sentences in the corpus since it is a one-to-one mapping function. CVAE is a technique for reducing generic replies. In this paper, we create a new dialogue model (CVAE-T) based on the Transformer with CVAE structure. We use a pre-trained MLM model to rewrite some key n-grams in responses to obtain a series of negative examples, and introduce a regularization term during training to explicitly guide the latent variable in learning the semantic differences between each pair of positive and negative examples. Experiments suggest that the method we design is capable of producing more informative replies.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Dense Connections · Softmax · Position-Wise Feed-Forward Layer · Adam · Label Smoothing · Absolute Position Encodings · Layer Normalization
