Uniform Transformation: Refining Latent Representation in Variational   Autoencoders

Ye Shi; C.S. George Lee

arXiv:2407.02681·cs.LG·July 4, 2024

Uniform Transformation: Refining Latent Representation in Variational Autoencoders

Ye Shi, C.S. George Lee

PDF

Open Access 1 Repo

TL;DR

This paper introduces a three-stage Uniform Transformation module for VAEs that reconfigures irregular latent distributions into uniform ones, significantly improving disentanglement and interpretability of latent representations.

Contribution

The paper proposes a novel adaptable three-stage UT module combining G-KDE clustering, GM modeling, and PIT to address irregular latent distributions in VAEs, enhancing their performance.

Findings

01

Improved disentanglement metrics on dSprites and MNIST datasets.

02

Effective reconfiguration of latent space into uniform distribution.

03

Enhanced interpretability of latent representations.

Abstract

Irregular distribution in latent space causes posterior collapse, misalignment between posterior and prior, and ill-sampling problem in Variational Autoencoders (VAEs). In this paper, we introduce a novel adaptable three-stage Uniform Transformation (UT) module -- Gaussian Kernel Density Estimation (G-KDE) clustering, non-parametric Gaussian Mixture (GM) Modeling, and Probability Integral Transform (PIT) -- to address irregular latent distributions. By reconfiguring irregular distributions into a uniform distribution in the latent space, our approach significantly enhances the disentanglement and interpretability of latent representations, overcoming the limitation of traditional VAE models in capturing complex data structures. Empirical evaluations demonstrated the efficacy of our proposed UT module in improving disentanglement metrics across benchmark datasets -- dSprites and MNIST.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

supershiye/UniformTransformationModule
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Natural Language Processing Techniques · Computational and Text Analysis Methods