How to train your VAE

Mariano Rivera

arXiv:2309.13160·cs.LG·June 25, 2024

How to train your VAE

Mariano Rivera

PDF

Open Access 1 Repo

TL;DR

This paper investigates the role of KL Divergence in VAEs, proposing a new ELBO formulation with a Gaussian mixture posterior, regularization, and adversarial training to improve face generation quality.

Contribution

It introduces a novel ELBO redefinition using Gaussian mixtures, adds regularization to prevent variance collapse, and employs adversarial training for enhanced realism in VAE outputs.

Findings

01

Generated realistic face images

02

Improved texture realism with PatchGAN discriminator

03

Enhanced VAE training stability

Abstract

Variational Autoencoders (VAEs) have become a cornerstone in generative modeling and representation learning within machine learning. This paper explores a nuanced aspect of VAEs, focusing on interpreting the Kullback-Leibler (KL) Divergence, a critical component within the Evidence Lower Bound (ELBO) that governs the trade-off between reconstruction accuracy and regularization. Meanwhile, the KL Divergence enforces alignment between latent variable distributions and a prior imposing a structure on the overall latent space but leaves individual variable distributions unconstrained. The proposed method redefines the ELBO with a mixture of Gaussians for the posterior probability, introduces a regularization term to prevent variance collapse, and employs a PatchGAN discriminator to enhance texture realism. Implementation details involve ResNetV2 architectures for both the Encoder and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

marianorivera/How2TrainUrVAE
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications

MethodsPatchGAN