Embrace the Gap: VAEs Perform Independent Mechanism Analysis

Patrik Reizinger; Luigi Gresele; Jack Brady; Julius von K\"ugelgen,; Dominik Zietlow; Bernhard Sch\"olkopf; Georg Martius; Wieland Brendel; Michel; Besserve

arXiv:2206.02416·stat.ML·January 30, 2023·5 cites

Embrace the Gap: VAEs Perform Independent Mechanism Analysis

Patrik Reizinger, Luigi Gresele, Jack Brady, Julius von K\"ugelgen,, Dominik Zietlow, Bernhard Sch\"olkopf, Georg Martius, Wieland Brendel, Michel, Besserve

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper explains why variational autoencoders (VAEs) effectively learn representations by demonstrating their connection to independent mechanism analysis (IMA) and showing they can recover true latent factors under certain conditions.

Contribution

It proves that in the near-deterministic decoder regime, VAEs' optimal encoder approximately inverts the decoder, linking ELBO maximization to IMA and improving understanding of their success.

Findings

01

VAEs perform independent mechanism analysis (IMA) under certain conditions.

02

The ELBO converges to a regularized log-likelihood, aiding representation learning.

03

VAEs recover true latent factors in synthetic and image data when IMA assumptions hold.

Abstract

Variational autoencoders (VAEs) are a popular framework for modeling complex data distributions; they can be efficiently trained via variational inference by maximizing the evidence lower bound (ELBO), at the expense of a gap to the exact (log-)marginal likelihood. While VAEs are commonly used for representation learning, it is unclear why ELBO maximization would yield useful representations, since unregularized maximum likelihood estimation cannot invert the data-generating process. Yet, VAEs often succeed at this task. We seek to elucidate this apparent paradox by studying nonlinear VAEs in the limit of near-deterministic decoders. We first prove that, in this regime, the optimal encoder approximately inverts the decoder -- a commonly used but unproven conjecture -- which we refer to as {\em self-consistency}. Leveraging self-consistency, we show that the ELBO converges to a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rpatrik96/ima-vae
jaxOfficial

Videos

Embrace the Gap: VAEs Perform Independent Mechanism Analysis· slideslive

Taxonomy

TopicsModel Reduction and Neural Networks · Domain Adaptation and Few-Shot Learning · Protein Structure and Dynamics

MethodsVariational Inference