AE-StyleGAN: Improved Training of Style-Based Auto-Encoders

Ligong Han; Sri Harsha Musunuri; Martin Renqiang Min; Ruijiang Gao; Yu; Tian; Dimitris Metaxas

arXiv:2110.08718·cs.CV·October 19, 2021

AE-StyleGAN: Improved Training of Style-Based Auto-Encoders

Ligong Han, Sri Harsha Musunuri, Martin Renqiang Min, Ruijiang Gao, Yu, Tian, Dimitris Metaxas

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces AE-StyleGAN, a style-based autoencoder trained end-to-end to improve image inversion and generation quality, and to enhance the disentanglement of the latent space.

Contribution

It proposes a novel end-to-end training methodology for style-based autoencoders that improves inversion and disentanglement over existing methods.

Findings

01

Outperforms baselines in image inversion quality

02

Achieves better disentanglement of latent space

03

Enhances generation quality

Abstract

StyleGANs have shown impressive results on data generation and manipulation in recent years, thanks to its disentangled style latent space. A lot of efforts have been made in inverting a pretrained generator, where an encoder is trained ad hoc after the generator is trained in a two-stage fashion. In this paper, we focus on style-based generators asking a scientific question: Does forcing such a generator to reconstruct real data lead to more disentangled latent space and make the inversion process from image to latent space easy? We describe a new methodology to train a style-based autoencoder where the encoder and generator are optimized end-to-end. We show that our proposed model consistently outperforms baselines in terms of image inversion and generation quality. Supplementary, code, and pretrained models are available on the project website.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

phymhan/stylegan2-pytorch
pytorchOfficial

Videos

AE-StyleGAN: Improved Training of Style-Based Auto-Encoders· youtube

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection · Image Processing and 3D Reconstruction

MethodsHigh-Order Consensuses