Overlooked Implications of the Reconstruction Loss for VAE   Disentanglement

Nathan Michlo; Richard Klein; Steven James

arXiv:2202.13341·cs.LG·October 27, 2023·1 cites

Overlooked Implications of the Reconstruction Loss for VAE Disentanglement

Nathan Michlo, Richard Klein, Steven James

PDF

Open Access 1 Repo

TL;DR

This paper reveals that the interaction between data and the reconstruction loss in VAEs significantly influences disentanglement, challenging the common focus on regularisation and highlighting the subjective nature of disentanglement outcomes.

Contribution

The work uncovers the overlooked role of the reconstruction loss in VAE disentanglement and introduces a theory for adversarial datasets that hinder disentanglement.

Findings

01

Standard datasets have unintended correlations affecting disentanglement.

02

Constructed datasets can prevent disentanglement despite human-understandable factors.

03

A new reconstruction loss can restore disentanglement by aligning with ground-truth factors.

Abstract

Learning disentangled representations with variational autoencoders (VAEs) is often attributed to the regularisation component of the loss. In this work, we highlight the interaction between data and the reconstruction term of the loss as the main contributor to disentanglement in VAEs. We show that standard benchmark datasets have unintended correlations between their subjective ground-truth factors and perceived axes in the data according to typical VAE reconstruction losses. Our work exploits this relationship to provide a theory for what constitutes an adversarial dataset under a given reconstruction loss. We verify this by constructing an example dataset that prevents disentanglement in state-of-the-art frameworks while maintaining human-intuitive ground-truth factors. Finally, we re-enable disentanglement by designing an example reconstruction loss that is once again able to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nmichlo/disent
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Digital Media Forensic Detection · Generative Adversarial Networks and Image Synthesis