Implicit Deep Latent Variable Models for Text Generation

Le Fang; Chunyuan Li; Jianfeng Gao; Wen Dong; Changyou Chen

arXiv:1908.11527·cs.LG·December 2, 2019·5 cites

Implicit Deep Latent Variable Models for Text Generation

Le Fang, Chunyuan Li, Jianfeng Gao, Wen Dong, Changyou Chen

PDF

Open Access 1 Repo

TL;DR

This paper introduces implicit deep latent variable models for text generation, overcoming Gaussian limitations and posterior collapse issues in VAEs by using sample-based representations and mutual information regularization, enhancing flexibility and performance.

Contribution

It proposes a novel implicit latent variable model with mutual information regularization, improving text generation quality and addressing limitations of traditional VAEs.

Findings

01

Effective in language modeling

02

Improves style transfer quality

03

Enhances dialog response generation

Abstract

Deep latent variable models (LVM) such as variational auto-encoder (VAE) have recently played an important role in text generation. One key factor is the exploitation of smooth latent structures to guide the generation. However, the representation power of VAEs is limited due to two reasons: (1) the Gaussian assumption is often made on the variational posteriors; and meanwhile (2) a notorious "posterior collapse" issue occurs. In this paper, we advocate sample-based representations of variational distributions for natural language, leading to implicit latent features, which can provide flexible representation power compared with Gaussian-based posteriors. We further develop an LVM to directly match the aggregated posterior to the prior. It can be viewed as a natural extension of VAEs with a regularization of maximizing mutual information, mitigating the "posterior collapse" issue. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

fangleai/Implicit-LVM
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis