Designing a 3D-Aware StyleNeRF Encoder for Face Editing
Songlin Yang, Wei Wang, Bo Peng, Jing Dong

TL;DR
This paper introduces a 3D-aware encoder based on StyleNeRF for face editing, enabling multi-view consistent face manipulation and improved temporal stability in videos.
Contribution
It proposes a novel 3D-aware encoder combining a parametric face model with StyleNeRF, enhancing 3D face editing capabilities beyond existing 2D GAN inversion methods.
Findings
Achieves 3D consistent face attribute editing and texture transfer.
Improves temporal consistency in video face editing.
Demonstrates superior performance over previous methods.
Abstract
GAN inversion has been exploited in many face manipulation tasks, but 2D GANs often fail to generate multi-view 3D consistent images. The encoders designed for 2D GANs are not able to provide sufficient 3D information for the inversion and editing. Therefore, 3D-aware GAN inversion is proposed to increase the 3D editing capability of GANs. However, the 3D-aware GAN inversion remains under-explored. To tackle this problem, we propose a 3D-aware (3Da) encoder for GAN inversion and face editing based on the powerful StyleNeRF model. Our proposed 3Da encoder combines a parametric 3D face model with a learnable detail representation model to generate geometry, texture and view direction codes. For more flexible face manipulation, we then design a dual-branch StyleFlow module to transfer the StyleNeRF codes with disentangled geometry and texture flows. Extensive experiments demonstrate that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Facial Nerve Paralysis Treatment and Research · Generative Adversarial Networks and Image Synthesis
Methodsfail
