Soft-IntroVAE: Analyzing and Improving the Introspective Variational   Autoencoder

Tal Daniel; Aviv Tamar

arXiv:2012.13253·cs.LG·March 26, 2021

Soft-IntroVAE: Analyzing and Improving the Introspective Variational Autoencoder

Tal Daniel, Aviv Tamar

PDF

2 Repos

TL;DR

Soft-IntroVAE introduces a smooth loss function to improve training stability and theoretical understanding of IntroVAE, leading to better image generation, reconstruction, and applications like image translation and out-of-distribution detection.

Contribution

It proposes the Soft-IntroVAE, a modified version of IntroVAE with a smooth exponential loss, enhancing stability and enabling comprehensive theoretical analysis.

Findings

01

Improved training stability with the exponential loss.

02

Theoretical convergence to a distribution minimizing KL and entropy.

03

Effective in image translation and out-of-distribution detection.

Abstract

The recently introduced introspective variational autoencoder (IntroVAE) exhibits outstanding image generations, and allows for amortized inference using an image encoder. The main idea in IntroVAE is to train a VAE adversarially, using the VAE encoder to discriminate between generated and real data samples. However, the original IntroVAE loss function relied on a particular hinge-loss formulation that is very hard to stabilize in practice, and its theoretical convergence analysis ignored important terms in the loss. In this work, we take a step towards better understanding of the IntroVAE model, its practical implementation, and its applications. We propose the Soft-IntroVAE, a modified IntroVAE that replaces the hinge-loss terms with a smooth exponential loss on generated samples. This change significantly improves training stability, and also enables theoretical analysis of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSolana Customer Service Number +1-833-534-1729 · USD Coin Customer Service Number +1-833-534-1729