Revealing Directions for Text-guided 3D Face Editing
Zhuo Chen, Yichao Yan, Sehngqi Liu, Yuhao Cheng, Weiming Zhao,, Lincheng Li, Mengxiao Bi, Xiaokang Yang

TL;DR
Face Clan introduces a fast, text-guided 3D face editing method that leverages diffusion models within 3D-aware GANs for precise, controllable, and intuitive face manipulation based on arbitrary attribute descriptions.
Contribution
The paper proposes integrating diffusion models into 3D-aware GANs to enable disentangled, controllable, and efficient text-guided 3D face editing with improved generalization.
Findings
Effective and generalizable face editing across pre-trained GANs.
Precise control over editing regions using diffusion-based masking.
Supports arbitrary text descriptions for intuitive face manipulation.
Abstract
3D face editing is a significant task in multimedia, aimed at the manipulation of 3D face models across various control signals. The success of 3D-aware GAN provides expressive 3D models learned from 2D single-view images only, encouraging researchers to discover semantic editing directions in its latent space. However, previous methods face challenges in balancing quality, efficiency, and generalization. To solve the problem, we explore the possibility of introducing the strength of diffusion model into 3D-aware GANs. In this paper, we present Face Clan, a fast and text-general approach for generating and manipulating 3D faces based on arbitrary attribute descriptions. To achieve disentangled editing, we propose to diffuse on the latent space under a pair of opposite prompts to estimate the mask indicating the region of interest on latent codes. Based on the mask, we then apply…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis
MethodsDiffusion
