Posterior Collapse and Latent Variable Non-identifiability
Yixin Wang, David M. Blei, John P. Cunningham

TL;DR
This paper links posterior collapse in variational autoencoders to latent variable non-identifiability, proving the phenomenon can occur even with exact inference, and proposes a new class of identifiable models that improve data representation.
Contribution
It establishes a theoretical connection between posterior collapse and non-identifiability, and introduces latent-identifiable variational autoencoders using bijective neural maps.
Findings
Latent-identifiable VAEs outperform existing methods in mitigating posterior collapse.
Posterior collapse can occur even with exact inference in classical models.
The proposed models produce more meaningful data representations.
Abstract
Variational autoencoders model high-dimensional data by positing low-dimensional latent variables that are mapped through a flexible distribution parametrized by a neural network. Unfortunately, variational autoencoders often suffer from posterior collapse: the posterior of the latent variables is equal to its prior, rendering the variational autoencoder useless as a means to produce meaningful representations. Existing approaches to posterior collapse often attribute it to the use of neural networks or optimization issues due to variational approximation. In this paper, we consider posterior collapse as a problem of latent variable non-identifiability. We prove that the posterior collapses if and only if the latent variables are non-identifiable in the generative model. This fact implies that posterior collapse is not a phenomenon specific to the use of flexible distributions or…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Statistical Methods and Inference
MethodsVariational Inference
