TL;DR
Soft-IntroVAE introduces a smooth loss function to improve training stability and theoretical understanding of IntroVAE, leading to better image generation, reconstruction, and applications like image translation and out-of-distribution detection.
Contribution
It proposes the Soft-IntroVAE, a modified version of IntroVAE with a smooth exponential loss, enhancing stability and enabling comprehensive theoretical analysis.
Findings
Improved training stability with the exponential loss.
Theoretical convergence to a distribution minimizing KL and entropy.
Effective in image translation and out-of-distribution detection.
Abstract
The recently introduced introspective variational autoencoder (IntroVAE) exhibits outstanding image generations, and allows for amortized inference using an image encoder. The main idea in IntroVAE is to train a VAE adversarially, using the VAE encoder to discriminate between generated and real data samples. However, the original IntroVAE loss function relied on a particular hinge-loss formulation that is very hard to stabilize in practice, and its theoretical convergence analysis ignored important terms in the loss. In this work, we take a step towards better understanding of the IntroVAE model, its practical implementation, and its applications. We propose the Soft-IntroVAE, a modified IntroVAE that replaces the hinge-loss terms with a smooth exponential loss on generated samples. This change significantly improves training stability, and also enables theoretical analysis of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSolana Customer Service Number +1-833-534-1729 · USD Coin Customer Service Number +1-833-534-1729
