Trained Latent Space Navigation to Prevent Lack of Photorealism in   Generated Images on Style-based Models

Takumi Harada; Kazuyuki Aihara; Hiroyuki Sakai

arXiv:2310.00936·cs.CV·October 3, 2023

Trained Latent Space Navigation to Prevent Lack of Photorealism in Generated Images on Style-based Models

Takumi Harada, Kazuyuki Aihara, Hiroyuki Sakai

PDF

Open Access

TL;DR

This paper introduces an unsupervised method for navigating the latent space of StyleGAN models to maintain photorealism during image manipulation, addressing a common challenge in generative image quality.

Contribution

The proposed method identifies local latent subspaces to enable realistic image editing while preserving photorealism, improving upon traditional latent code manipulation techniques.

Findings

01

Images within local latent subspaces retain photorealism after manipulation.

02

Method effectively applies to various style-based models.

03

Enhances latent code optimization for high-quality image generation.

Abstract

Recent studies on StyleGAN variants show promising performances for various generation tasks. In these models, latent codes have traditionally been manipulated and searched for the desired images. However, this approach sometimes suffers from a lack of photorealism in generated images due to a lack of knowledge about the geometry of the trained latent space. In this paper, we show a simple unsupervised method that provides well-trained local latent subspace, enabling latent code navigation while preserving the photorealism of the generated images. Specifically, the method identifies densely mapped latent spaces and restricts latent manipulations within the local latent subspace. Experimental results demonstrate that images generated within the local latent subspace maintain photorealism even when the latent codes are significantly and repeatedly manipulated. Moreover, experiments show…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications

MethodsDense Connections · Feedforward Network · Convolution · HuMan(Expedia)||How do I get a human at Expedia? · R1 Regularization · Adaptive Instance Normalization · StyleGAN