The Neglected Sibling: Isotropic Gaussian Posterior for VAE
Lan Zhang, Wray Buntine, Ehsan Shareghi

TL;DR
This paper introduces an Isotropic Gaussian Posterior for VAEs, improving latent space utilization, robustness, and sample efficiency, with demonstrated benefits across NLP and image tasks through theoretical and empirical analysis.
Contribution
The paper proposes a simple modification to VAEs using an Isotropic Gaussian Posterior, enhancing latent space usage and performance, which is validated through extensive experiments.
Findings
Improved downstream task performance
Enhanced sample efficiency and robustness
Generalization to image domain
Abstract
Deep generative models have been widely used in several areas of NLP, and various techniques have been proposed to augment them or address their training challenges. In this paper, we propose a simple modification to Variational Autoencoders (VAEs) by using an Isotropic Gaussian Posterior (IGP) that allows for better utilisation of their latent representation space. This model avoids the sub-optimal behavior of VAEs related to inactive dimensions in the representation space. We provide both theoretical analysis, and empirical evidence on various datasets and tasks that show IGP leads to consistent improvement on several quantitative and qualitative grounds, from downstream task performance and sample efficiency to robustness. Additionally, we give insights about the representational properties encouraged by IGP and also show that its gain generalises to image domain as well.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Speech Recognition and Synthesis · Energy Load and Power Forecasting
