Improving Inversion and Generation Diversity in StyleGAN using a Gaussianized Latent Space
Jonas Wulff, Antonio Torralba

TL;DR
This paper introduces a Gaussianized latent space model for StyleGAN that improves inversion stability, enhances image diversity, and reduces artifacts by regularizing latent projections with a Gaussian prior.
Contribution
It proposes a simple nonlinear transformation to model latent space as Gaussian, leading to more stable image inversion and artifact reduction in StyleGAN.
Findings
Smoother interpolation between real and generated images
Improved stability in latent space projections
Reduced artifacts while maintaining diversity
Abstract
Modern Generative Adversarial Networks are capable of creating artificial, photorealistic images from latent vectors living in a low-dimensional learned latent space. It has been shown that a wide range of images can be projected into this space, including images outside of the domain that the generator was trained on. However, while in this case the generator reproduces the pixels and textures of the images, the reconstructed latent vectors are unstable and small perturbations result in significant image distortions. In this work, we propose to explicitly model the data distribution in latent space. We show that, under a simple nonlinear operation, the data distribution can be modeled as Gaussian and therefore expressed using sufficient statistics. This yields a simple Gaussian prior, which we use to regularize the projection of images into the latent space. The resulting projections…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Computer Graphics and Visualization Techniques · Digital Media Forensic Detection
