$\alpha$-TCVAE: On the relationship between Disentanglement and   Diversity

Cristian Meo; Louis Mahon; Anirudh Goyal; Justin Dauwels

arXiv:2411.00588·cs.LG·November 4, 2024

$\alpha$-TCVAE: On the relationship between Disentanglement and Diversity

Cristian Meo, Louis Mahon, Anirudh Goyal, Justin Dauwels

PDF

Open Access

TL;DR

This paper introduces $oldsymbol{ extalpha}$-TCVAE, a variational autoencoder that maximizes disentanglement and diversity of generated data through a novel total correlation bound, demonstrating improved representations and generative performance on complex datasets and downstream tasks.

Contribution

The paper proposes $oldsymbol{ extalpha}$-TCVAE with a new TC lower bound grounded in information theory, enhancing disentanglement and diversity in generative models, especially for complex datasets.

Findings

01

$oldsymbol{ extalpha}$-TCVAE learns more disentangled representations than baselines.

02

It generates more diverse observations without losing visual fidelity.

03

It improves downstream performance in RL tasks, notably in the Ant Maze environment.

Abstract

While disentangled representations have shown promise in generative modeling and representation learning, their downstream usefulness remains debated. Recent studies re-defined disentanglement through a formal connection to symmetries, emphasizing the ability to reduce latent domains and consequently enhance generative capabilities. However, from an information theory viewpoint, assigning a complex attribute to a specific latent variable may be infeasible, limiting the applicability of disentangled representations to simple datasets. In this work, we introduce $α$ -TCVAE, a variational autoencoder optimized using a novel total correlation (TC) lower bound that maximizes disentanglement and latent variables informativeness. The proposed TC bound is grounded in information theory constructs, generalizes the $β$ -VAE lower bound, and can be reduced to a convex combination of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputability, Logic, AI Algorithms