Understanding Variational Autoencoders with Intrinsic Dimension and   Information Imbalance

Charles Camboulin; Diego Doimo; Aldo Glielmo

arXiv:2411.01978·cs.LG·November 5, 2024

Understanding Variational Autoencoders with Intrinsic Dimension and Information Imbalance

Charles Camboulin, Diego Doimo, Aldo Glielmo

PDF

Open Access 1 Repo

TL;DR

This paper analyzes how Variational Autoencoders process information by examining intrinsic dimension and information imbalance, revealing phase transitions and providing insights into architecture design and model behavior.

Contribution

It introduces the use of Intrinsic Dimension and Information Imbalance to analyze VAE representations, uncovering phase transitions and training dynamics.

Findings

01

VAEs exhibit a double hunchback ID profile when bottleneck exceeds data ID

02

Two training phases identified: rapid fitting and slower generalization

03

II and ID are useful for architecture search and diagnosing underfitting

Abstract

This work presents an analysis of the hidden representations of Variational Autoencoders (VAEs) using the Intrinsic Dimension (ID) and the Information Imbalance (II). We show that VAEs undergo a transition in behaviour once the bottleneck size is larger than the ID of the data, manifesting in a double hunchback ID profile and a qualitative shift in information processing as captured by the II. Our results also highlight two distinct training phases for architectures with sufficiently large bottleneck sizes, consisting of a rapid fit and a slower generalisation, as assessed by a differentiated behaviour of ID, II, and KL loss. These insights demonstrate that II and ID could be valuable tools for aiding architecture search, for diagnosing underfitting in VAEs, and, more broadly, they contribute to advancing a unified understanding of deep generative models through geometric analysis.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bancaditalia/understanding-variational-autoencoders-with-intrinsic-dimension-and-information-imbalance
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis