Implicit Deep Latent Variable Models for Text Generation
Le Fang, Chunyuan Li, Jianfeng Gao, Wen Dong, Changyou Chen

TL;DR
This paper introduces implicit deep latent variable models for text generation, overcoming Gaussian limitations and posterior collapse issues in VAEs by using sample-based representations and mutual information regularization, enhancing flexibility and performance.
Contribution
It proposes a novel implicit latent variable model with mutual information regularization, improving text generation quality and addressing limitations of traditional VAEs.
Findings
Effective in language modeling
Improves style transfer quality
Enhances dialog response generation
Abstract
Deep latent variable models (LVM) such as variational auto-encoder (VAE) have recently played an important role in text generation. One key factor is the exploitation of smooth latent structures to guide the generation. However, the representation power of VAEs is limited due to two reasons: (1) the Gaussian assumption is often made on the variational posteriors; and meanwhile (2) a notorious "posterior collapse" issue occurs. In this paper, we advocate sample-based representations of variational distributions for natural language, leading to implicit latent features, which can provide flexible representation power compared with Gaussian-based posteriors. We further develop an LVM to directly match the aggregated posterior to the prior. It can be viewed as a natural extension of VAEs with a regularization of maximizing mutual information, mitigating the "posterior collapse" issue. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis
