TL;DR
SofGAN introduces a novel portrait image generator that decouples geometry and texture in the latent space, enabling independent control over attributes and high-quality, consistent portrait synthesis.
Contribution
The paper proposes a new GAN architecture that separates geometry and texture latent spaces, incorporating 3D geometry and semantic segmentation for improved control and realism.
Findings
Generates high-quality portraits with independent geometry and texture control.
Achieves consistent semantic segmentation across different views.
Demonstrates effectiveness in facial animation and dynamic styling applications.
Abstract
Recently, Generative Adversarial Networks (GANs)} have been widely used for portrait image generation. However, in the latent space learned by GANs, different attributes, such as pose, shape, and texture style, are generally entangled, making the explicit control of specific attributes difficult. To address this issue, we propose a SofGAN image generator to decouple the latent space of portraits into two subspaces: a geometry space and a texture space. The latent codes sampled from the two subspaces are fed to two network branches separately, one to generate the 3D geometry of portraits with canonical pose, and the other to generate textures. The aligned 3D geometries also come with semantic part segmentation, encoded as a semantic occupancy field (SOF). The SOF allows the rendering of consistent 2D semantic segmentation maps at arbitrary views, which are then fused with the generated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsDense Connections · Adaptive Instance Normalization · R1 Regularization · Feedforward Network · HuMan(Expedia)||How do I get a human at Expedia? · Convolution · StyleGAN
