Posterior Collapse and Latent Variable Non-identifiability

Yixin Wang; David M. Blei; John P. Cunningham

arXiv:2301.00537·stat.ML·January 3, 2023·26 cites

Posterior Collapse and Latent Variable Non-identifiability

Yixin Wang, David M. Blei, John P. Cunningham

PDF

Open Access 1 Video

TL;DR

This paper links posterior collapse in variational autoencoders to latent variable non-identifiability, proving the phenomenon can occur even with exact inference, and proposes a new class of identifiable models that improve data representation.

Contribution

It establishes a theoretical connection between posterior collapse and non-identifiability, and introduces latent-identifiable variational autoencoders using bijective neural maps.

Findings

01

Latent-identifiable VAEs outperform existing methods in mitigating posterior collapse.

02

Posterior collapse can occur even with exact inference in classical models.

03

The proposed models produce more meaningful data representations.

Abstract

Variational autoencoders model high-dimensional data by positing low-dimensional latent variables that are mapped through a flexible distribution parametrized by a neural network. Unfortunately, variational autoencoders often suffer from posterior collapse: the posterior of the latent variables is equal to its prior, rendering the variational autoencoder useless as a means to produce meaningful representations. Existing approaches to posterior collapse often attribute it to the use of neural networks or optimization issues due to variational approximation. In this paper, we consider posterior collapse as a problem of latent variable non-identifiability. We prove that the posterior collapses if and only if the latent variables are non-identifiable in the generative model. This fact implies that posterior collapse is not a phenomenon specific to the use of flexible distributions or…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Posterior Collapse and Latent Variable Non-identifiability· slideslive

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Statistical Methods and Inference

MethodsVariational Inference