Semantic Unfolding of StyleGAN Latent Space

Mustafa Shukor; Xu Yao; Bharath Bushan Damodaran; Pierre Hellier

arXiv:2206.14892·cs.CV·July 1, 2022

Semantic Unfolding of StyleGAN Latent Space

Mustafa Shukor, Xu Yao, Bharath Bushan Damodaran, Pierre Hellier

PDF

Open Access

TL;DR

This paper introduces a supervised method using normalizing flows to improve the semantic disentanglement of GAN latent spaces, enhancing face image editing capabilities.

Contribution

It proposes a novel approach to better disentangle facial attributes in GANs by learning a proxy latent space with normalizing flows, addressing limitations of linear attribute separation.

Findings

01

Enhanced facial attribute disentanglement

02

More efficient face image editing

03

Supervised learning improves latent space structure

Abstract

Generative adversarial networks (GANs) have proven to be surprisingly efficient for image editing by inverting and manipulating the latent code corresponding to an input real image. This editing property emerges from the disentangled nature of the latent space. In this paper, we identify that the facial attribute disentanglement is not optimal, thus facial editing relying on linear attribute separation is flawed. We thus propose to improve semantic disentanglement with supervision. Our method consists in learning a proxy latent representation using normalizing flows, and we show that this leads to a more efficient space for face image editing.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Face recognition and analysis · Digital Media Forensic Detection