Generating new pictures in complex datasets with a simple neural network

Galin Georgiev

arXiv:1810.12478·cs.CV·November 28, 2019

Generating new pictures in complex datasets with a simple neural network

Galin Georgiev

PDF

Open Access

TL;DR

This paper presents a simple variational auto-encoder with minimal latent dimensions that can generate realistic perturbations of images in complex datasets like CIFAR-10, despite not reconstructing all images perfectly.

Contribution

The authors introduce a VAE with only two latent dimensions per class, enabling effective image perturbations in complex datasets, which is a novel approach.

Findings

01

Effective image perturbation in CIFAR-10

02

Requires classifier to identify well reconstructed images

03

Limited reconstruction quality for some training images

Abstract

We introduce a version of a variational auto-encoder (VAE), which can generate good perturbations of images, when trained on a complex dataset (in our experiments, CIFAR-10). The net is using only two latent generative dimensions per class, with uni-modal probability density. The price one has to pay for good generation is that not all training images are well reconstructed. An additional classifier is required to determine which training image is well reconstructed and generally the weights of training images. Only training images which are well reconstructed, can be perturbed. For good perturbations, we use the tentative empirical drifts of well reconstructed images. The construct is not predictive in the usual statistical sense.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Image and Signal Denoising Methods · Model Reduction and Neural Networks