Improve Variational Autoencoder for Text Generationwith Discrete Latent   Bottleneck

Yang Zhao; Ping Yu; Suchismit Mahapatra; Qinliang Su; Changyou Chen

arXiv:2004.10603·cs.LG·February 26, 2021·1 cites

Improve Variational Autoencoder for Text Generationwith Discrete Latent Bottleneck

Yang Zhao, Ping Yu, Suchismit Mahapatra, Qinliang Su, Changyou Chen

PDF

Open Access

TL;DR

This paper introduces a discretized bottleneck in variational autoencoders to improve text generation by encouraging meaningful latent representations, leading to better interpretability and performance across various NLP tasks.

Contribution

The paper proposes a novel discretized latent bottleneck in VAEs that enhances semantic modeling and interpretability in text generation tasks.

Findings

01

Improved text generation quality across multiple NLP tasks.

02

Enhanced interpretability of latent structures.

03

Demonstrated efficiency and effectiveness empirically.

Abstract

Variational autoencoders (VAEs) are essential tools in end-to-end representation learning. However, the sequential text generation common pitfall with VAEs is that the model tends to ignore latent variables with a strong auto-regressive decoder. In this paper, we propose a principled approach to alleviate this issue by applying a discretized bottleneck to enforce an implicit latent feature matching in a more compact latent space. We impose a shared discrete latent space where each input is learned to choose a combination of latent atoms as a regularized latent representation. Our model endows a promising capability to model underlying semantics of discrete sequences and thus provide more interpretative latent structures. Empirically, we demonstrate our model's efficiency and effectiveness on a broad range of tasks, including language modeling, unaligned text style transfer, dialog…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications