Embrace the Gap: VAEs Perform Independent Mechanism Analysis
Patrik Reizinger, Luigi Gresele, Jack Brady, Julius von K\"ugelgen,, Dominik Zietlow, Bernhard Sch\"olkopf, Georg Martius, Wieland Brendel, Michel, Besserve

TL;DR
This paper explains why variational autoencoders (VAEs) effectively learn representations by demonstrating their connection to independent mechanism analysis (IMA) and showing they can recover true latent factors under certain conditions.
Contribution
It proves that in the near-deterministic decoder regime, VAEs' optimal encoder approximately inverts the decoder, linking ELBO maximization to IMA and improving understanding of their success.
Findings
VAEs perform independent mechanism analysis (IMA) under certain conditions.
The ELBO converges to a regularized log-likelihood, aiding representation learning.
VAEs recover true latent factors in synthetic and image data when IMA assumptions hold.
Abstract
Variational autoencoders (VAEs) are a popular framework for modeling complex data distributions; they can be efficiently trained via variational inference by maximizing the evidence lower bound (ELBO), at the expense of a gap to the exact (log-)marginal likelihood. While VAEs are commonly used for representation learning, it is unclear why ELBO maximization would yield useful representations, since unregularized maximum likelihood estimation cannot invert the data-generating process. Yet, VAEs often succeed at this task. We seek to elucidate this apparent paradox by studying nonlinear VAEs in the limit of near-deterministic decoders. We first prove that, in this regime, the optimal encoder approximately inverts the decoder -- a commonly used but unproven conjecture -- which we refer to as {\em self-consistency}. Leveraging self-consistency, we show that the ELBO converges to a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsModel Reduction and Neural Networks · Domain Adaptation and Few-Shot Learning · Protein Structure and Dynamics
MethodsVariational Inference
