The Neglected Sibling: Isotropic Gaussian Posterior for VAE

Lan Zhang; Wray Buntine; Ehsan Shareghi

arXiv:2110.07383·cs.LG·October 15, 2021

The Neglected Sibling: Isotropic Gaussian Posterior for VAE

Lan Zhang, Wray Buntine, Ehsan Shareghi

PDF

Open Access 1 Repo

TL;DR

This paper introduces an Isotropic Gaussian Posterior for VAEs, improving latent space utilization, robustness, and sample efficiency, with demonstrated benefits across NLP and image tasks through theoretical and empirical analysis.

Contribution

The paper proposes a simple modification to VAEs using an Isotropic Gaussian Posterior, enhancing latent space usage and performance, which is validated through extensive experiments.

Findings

01

Improved downstream task performance

02

Enhanced sample efficiency and robustness

03

Generalization to image domain

Abstract

Deep generative models have been widely used in several areas of NLP, and various techniques have been proposed to augment them or address their training challenges. In this paper, we propose a simple modification to Variational Autoencoders (VAEs) by using an Isotropic Gaussian Posterior (IGP) that allows for better utilisation of their latent representation space. This model avoids the sub-optimal behavior of VAEs related to inactive dimensions in the representation space. We provide both theoretical analysis, and empirical evidence on various datasets and tasks that show IGP leads to consistent improvement on several quantitative and qualitative grounds, from downstream task performance and sample efficiency to robustness. Additionally, we give insights about the representational properties encouraged by IGP and also show that its gain generalises to image domain as well.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lanzhang128/IGPVAE
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech Recognition and Synthesis · Energy Load and Power Forecasting