AE-StyleGAN: Improved Training of Style-Based Auto-Encoders
Ligong Han, Sri Harsha Musunuri, Martin Renqiang Min, Ruijiang Gao, Yu, Tian, Dimitris Metaxas

TL;DR
This paper introduces AE-StyleGAN, a style-based autoencoder trained end-to-end to improve image inversion and generation quality, and to enhance the disentanglement of the latent space.
Contribution
It proposes a novel end-to-end training methodology for style-based autoencoders that improves inversion and disentanglement over existing methods.
Findings
Outperforms baselines in image inversion quality
Achieves better disentanglement of latent space
Enhances generation quality
Abstract
StyleGANs have shown impressive results on data generation and manipulation in recent years, thanks to its disentangled style latent space. A lot of efforts have been made in inverting a pretrained generator, where an encoder is trained ad hoc after the generator is trained in a two-stage fashion. In this paper, we focus on style-based generators asking a scientific question: Does forcing such a generator to reconstruct real data lead to more disentangled latent space and make the inversion process from image to latent space easy? We describe a new methodology to train a style-based autoencoder where the encoder and generator are optimized end-to-end. We show that our proposed model consistently outperforms baselines in terms of image inversion and generation quality. Supplementary, code, and pretrained models are available on the project website.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
AE-StyleGAN: Improved Training of Style-Based Auto-Encoders· youtube
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection · Image Processing and 3D Reconstruction
MethodsHigh-Order Consensuses
