Variational Transformers for Diverse Response Generation
Zhaojiang Lin, Genta Indra Winata, Peng Xu, Zihan Liu, Pascale Fung

TL;DR
This paper introduces Variational Transformers, a novel sequence model combining Transformer architecture with variational latent variables to enhance diversity and relevance in dialogue response generation.
Contribution
It proposes a new Variational Transformer model that incorporates stochastic latent variables into Transformers for more diverse and contextually relevant dialogue responses.
Findings
Models outperform baselines in response diversity.
Enhanced semantic relevance in generated responses.
Improved human judgment scores.
Abstract
Despite the great promise of Transformers in many sequence modeling tasks (e.g., machine translation), their deterministic nature hinders them from generalizing to high entropy tasks such as dialogue response generation. Previous work proposes to capture the variability of dialogue responses with a recurrent neural network (RNN)-based conditional variational autoencoder (CVAE). However, the autoregressive computation of the RNN limits the training efficiency. Therefore, we propose the Variational Transformer (VT), a variational self-attentive feed-forward sequence model. The VT combines the parallelizability and global receptive field of the Transformer with the variational nature of the CVAE by incorporating stochastic latent variables into Transformers. We explore two types of the VT: 1) modeling the discourse-level diversity with a global latent variable; and 2) augmenting the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Music and Audio Processing
MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Conditional Variational Auto Encoder · Solana Customer Service Number +1-833-534-1729 · Residual Connection · Byte Pair Encoding · Dense Connections · Label Smoothing · *Communicated@Fast*How Do I Communicate to Expedia?
