Improving Inversion and Generation Diversity in StyleGAN using a   Gaussianized Latent Space

Jonas Wulff; Antonio Torralba

arXiv:2009.06529·cs.CV·September 15, 2020·26 cites

Improving Inversion and Generation Diversity in StyleGAN using a Gaussianized Latent Space

Jonas Wulff, Antonio Torralba

PDF

Open Access

TL;DR

This paper introduces a Gaussianized latent space model for StyleGAN that improves inversion stability, enhances image diversity, and reduces artifacts by regularizing latent projections with a Gaussian prior.

Contribution

It proposes a simple nonlinear transformation to model latent space as Gaussian, leading to more stable image inversion and artifact reduction in StyleGAN.

Findings

01

Smoother interpolation between real and generated images

02

Improved stability in latent space projections

03

Reduced artifacts while maintaining diversity

Abstract

Modern Generative Adversarial Networks are capable of creating artificial, photorealistic images from latent vectors living in a low-dimensional learned latent space. It has been shown that a wide range of images can be projected into this space, including images outside of the domain that the generator was trained on. However, while in this case the generator reproduces the pixels and textures of the images, the reconstructed latent vectors are unstable and small perturbations result in significant image distortions. In this work, we propose to explicitly model the data distribution in latent space. We show that, under a simple nonlinear operation, the data distribution can be modeled as Gaussian and therefore expressed using sufficient statistics. This yields a simple Gaussian prior, which we use to regularize the projection of images into the latent space. The resulting projections…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Computer Graphics and Visualization Techniques · Digital Media Forensic Detection