Injecting 3D Perception of Controllable NeRF-GAN into StyleGAN for Editable Portrait Image Synthesis
Jeong-gi Kwak, Yuanming Li, Dongsik Yoon, Donghyeon Kim, David Han,, Hanseok Ko

TL;DR
This paper introduces SURF-GAN, a 3D-aware GAN that discovers and controls semantic attributes unsupervised, and integrates it into StyleGAN to enable explicit 3D pose control in high-fidelity portrait synthesis, addressing multi-view inconsistency and editing limitations.
Contribution
The paper proposes SURF-GAN, a novel 3D-aware GAN that learns semantic attributes unsupervised and injects this knowledge into StyleGAN for explicit 3D pose control in portrait generation.
Findings
SURF-GAN can discover semantic attributes during training.
Injected SURF-GAN prior enables explicit pose control in StyleGAN.
The method improves multi-view consistency and editing capabilities.
Abstract
Over the years, 2D GANs have achieved great successes in photorealistic portrait generation. However, they lack 3D understanding in the generation process, thus they suffer from multi-view inconsistency problem. To alleviate the issue, many 3D-aware GANs have been proposed and shown notable results, but 3D GANs struggle with editing semantic attributes. The controllability and interpretability of 3D GANs have not been much explored. In this work, we propose two solutions to overcome these weaknesses of 2D GANs and 3D-aware GANs. We first introduce a novel 3D-aware GAN, SURF-GAN, which is capable of discovering semantic attributes during training and controlling them in an unsupervised manner. After that, we inject the prior of SURF-GAN into StyleGAN to obtain a high-fidelity 3D-controllable generator. Unlike existing latent-based methods allowing implicit pose control, the proposed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · 3D Shape Modeling and Analysis · 3D Surveying and Cultural Heritage
MethodsStyleGAN · Dense Connections · Convolution · HuMan(Expedia)||How do I get a human at Expedia? · R1 Regularization · Feedforward Network · Adaptive Instance Normalization
