Piecewise Latent Variables for Neural Variational Text Processing

Iulian V. Serban; Alexander G. Ororbia II; Joelle Pineau; Aaron; Courville

arXiv:1612.00377·cs.CL·September 26, 2017

Piecewise Latent Variables for Neural Variational Text Processing

Iulian V. Serban, Alexander G. Ororbia II, Joelle Pineau, Aaron, Courville

PDF

Open Access 2 Repos

TL;DR

This paper introduces a flexible piecewise constant distribution for neural variational models, enabling better representation of complex, multi-modal latent factors in natural language processing tasks, leading to improved performance.

Contribution

It proposes a novel, highly flexible piecewise constant distribution for latent variables, overcoming limitations of traditional Gaussian priors in neural variational models.

Findings

01

Enhanced document modeling performance

02

Improved natural language generation quality

03

Representation of complex latent factors

Abstract

Advances in neural variational inference have facilitated the learning of powerful directed graphical models with continuous latent variables, such as variational autoencoders. The hope is that such models will learn to represent rich, multi-modal latent factors in real-world data, such as natural language text. However, current models often assume simplistic priors on the latent variables - such as the uni-modal Gaussian distribution - which are incapable of representing complex latent factors efficiently. To overcome this restriction, we propose the simple, but highly flexible, piecewise constant distribution. This distribution has the capacity to represent an exponential number of modes of a latent target distribution, while remaining mathematically tractable. Our results demonstrate that incorporating this new latent distribution into different models yields substantial improvements…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis