Enhancing variational generation through self-decomposition

Andrea Asperti; Laura Bugo; Daniele Filippini

arXiv:2202.02738·cs.CV·July 15, 2022

Enhancing variational generation through self-decomposition

Andrea Asperti, Laura Bugo, Daniele Filippini

PDF

1 Repo

TL;DR

This paper introduces Split Variational Autoencoders (SVAE), which decompose generated images into meaningful components using learned maps, improving generation quality and interpretability without extra loss functions.

Contribution

The paper proposes a novel SVAE model that automatically decomposes images into components via learned maps, enhancing generative performance without additional training constraints.

Findings

01

SVAE outperforms previous variational models on MNIST, CIFAR-10, and CelebA.

02

Decomposition schemes can be syntactic or semantic, affecting image quality.

03

The method improves FID scores by encouraging meaningful image splits.

Abstract

In this article we introduce the notion of Split Variational Autoencoder (SVAE), whose output $\overset{x}{^}$ is obtained as a weighted sum $σ ⊙ \overset{x_{1}}{^} + (1 - σ) ⊙ \overset{x_{2}}{^}$ of two generated images $\overset{x_{1}}{^}, \overset{x_{2}}{^}$ , and $σ$ is a {\em learned} compositional map. The composing images $\overset{x_{1}}{^}, \overset{x_{2}}{^}$ , as well as the $σ$ -map are automatically synthesized by the model. The network is trained as a usual Variational Autoencoder with a negative loglikelihood loss between training and reconstructed images. No additional loss is required for $\overset{x_{1}}{^}, \overset{x_{2}}{^}$ or $σ$ , neither any form of human tuning. The decomposition is nondeterministic, but follows two main schemes, that we may roughly categorize as either \say{syntactic} or \say{semantic}. In the first case, the map tends to exploit the strong correlation between adjacent pixels, splitting the image…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

asperti/split-vae
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.