Towards Photographic Image Manipulation with Balanced Growing of   Generative Autoencoders

Ari Heljakka; Arno Solin; Juho Kannala

arXiv:1904.06145·cs.LG·February 21, 2020

Towards Photographic Image Manipulation with Balanced Growing of Generative Autoencoders

Ari Heljakka, Arno Solin, Juho Kannala

PDF

1 Repo

TL;DR

This paper introduces an improved generative autoencoder that achieves fast encoding, high-quality high-resolution images, and a well-structured latent space for semantic manipulation, advancing face image editing technology.

Contribution

It significantly enhances the PIONEER autoencoder model by altering training dynamics, leading to better image quality, identity preservation, and disentangled latent representations.

Findings

01

Improved face identity conservation on CelebAHQ.

02

State-of-the-art latent space disentanglement.

03

Enhanced performance on LSUN Bedrooms dataset.

Abstract

We present a generative autoencoder that provides fast encoding, faithful reconstructions (eg. retaining the identity of a face), sharp generated/reconstructed samples in high resolutions, and a well-structured latent space that supports semantic manipulation of the inputs. There are no current autoencoder or GAN models that satisfactorily achieve all of these. We build on the progressively growing autoencoder model PIONEER, for which we completely alter the training dynamics based on a careful analysis of recently introduced normalization schemes. We show significantly improved visual and quantitative results for face identity conservation in CelebAHQ. Our model achieves state-of-the-art disentanglement of latent space, both quantitatively and via realistic image attribute manipulations. On the LSUN Bedrooms dataset, we improve the disentanglement performance of the vanilla PIONEER,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

AaltoVision/balanced-pioneer
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSolana Customer Service Number +1-833-534-1729 · Convolution · Dogecoin Customer Service Number +1-833-534-1729