Educating Text Autoencoders: Latent Representation Guidance via   Denoising

Tianxiao Shen; Jonas Mueller; Regina Barzilay; Tommi Jaakkola

arXiv:1905.12777·cs.LG·July 8, 2020·31 cites

Educating Text Autoencoders: Latent Representation Guidance via Denoising

Tianxiao Shen, Jonas Mueller, Regina Barzilay, Tommi Jaakkola

PDF

Open Access 3 Repos 1 Video

TL;DR

This paper introduces a denoising objective for autoencoders to improve the structure of latent spaces, enabling more coherent text manipulation and zero-shot style transfer.

Contribution

It proposes a denoising adversarial autoencoder that guides latent space geometry, enhancing controllable text generation and style transfer capabilities.

Findings

01

Improved latent space structure for text autoencoders

02

Enhanced zero-shot style transfer via latent vector arithmetic

03

Better trade-off between generation quality and reconstruction

Abstract

Generative autoencoders offer a promising approach for controllable text generation by leveraging their latent sentence representations. However, current models struggle to maintain coherent latent spaces required to perform meaningful text manipulations via latent vector operations. Specifically, we demonstrate by example that neural encoders do not necessarily map similar sentences to nearby latent vectors. A theoretical explanation for this phenomenon establishes that high capacity autoencoders can learn an arbitrary mapping between sequences and associated latent representations. To remedy this issue, we augment adversarial autoencoders with a denoising objective where original sentences are reconstructed from perturbed versions (referred to as DAAE). We prove that this simple modification guides the latent space geometry of the resulting model by encouraging the encoder to map…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Educating Text Autoencoders: Latent Representation Guidance via Denoising· slideslive

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Generative Adversarial Networks and Image Synthesis