Disentangling Generative Factors in Natural Language with Discrete   Variational Autoencoders

Giangiacomo Mercatali; Andr\'e Freitas

arXiv:2109.07169·cs.CL·September 16, 2021

Disentangling Generative Factors in Natural Language with Discrete Variational Autoencoders

Giangiacomo Mercatali, Andr\'e Freitas

PDF

Open Access

TL;DR

This paper introduces a discrete variational autoencoder approach for learning disentangled, interpretable representations in NLP by modeling linguistic features as discrete variables, outperforming existing methods.

Contribution

It proposes a novel discrete VAE model tailored for textual data, addressing limitations of continuous models in capturing discrete linguistic features.

Findings

01

Outperforms continuous and discrete baselines on disentanglement benchmarks

02

Achieves better interpretability of linguistic features

03

Improves text style transfer performance

Abstract

The ability of learning disentangled representations represents a major step for interpretable NLP systems as it allows latent linguistic features to be controlled. Most approaches to disentanglement rely on continuous variables, both for images and text. We argue that despite being suitable for image datasets, continuous variables may not be ideal to model features of textual data, due to the fact that most generative factors in text are discrete. We propose a Variational Autoencoder based method which models language features as discrete variables and encourages independence between variables for learning disentangled representations. The proposed model outperforms continuous and discrete baselines on several qualitative and quantitative benchmarks for disentanglement as well as on a text style transfer downstream application.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection · Topic Modeling