DialogWAE: Multimodal Response Generation with Conditional Wasserstein Auto-Encoder
Xiaodong Gu, Kyunghyun Cho, Jung-Woo Ha, Sunghun Kim

TL;DR
DialogWAE introduces a novel dialogue response generation model that leverages a conditional Wasserstein autoencoder with a GAN-based latent space to produce more coherent, diverse, and informative responses than previous methods.
Contribution
The paper proposes DialogWAE, a new conditional Wasserstein autoencoder for dialogue modeling that uses a GAN in the latent space and a Gaussian mixture prior to enhance response diversity and quality.
Findings
Outperforms state-of-the-art models in response coherence and diversity
Enables richer, more informative dialogue responses
Utilizes GAN-based latent space for improved data distribution modeling
Abstract
Variational autoencoders~(VAEs) have shown a promise in data-driven conversation modeling. However, most VAE conversation models match the approximate posterior distribution over the latent variables to a simple prior such as standard normal distribution, thereby restricting the generated responses to a relatively simple (e.g., unimodal) scope. In this paper, we propose DialogWAE, a conditional Wasserstein autoencoder~(WAE) specially designed for dialogue modeling. Unlike VAEs that impose a simple distribution over the latent variables, DialogWAE models the distribution of data by training a GAN within the latent variable space. Specifically, our model samples from the prior and posterior distributions over the latent variables by transforming context-dependent random noise using neural networks and minimizes the Wasserstein distance between the two distributions. We further develop a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Natural Language Processing Techniques
MethodsConvolution · USD Coin Customer Service Number +1-833-534-1729 · Dogecoin Customer Service Number +1-833-534-1729
