3D Guided Fine-Grained Face Manipulation
Zhenglin Geng, Chen Cao, Sergey Tulyakov

TL;DR
This paper introduces a 3D guided approach for fine-grained face manipulation that can generate arbitrary expressions on a given face by disentangling shape and texture, enabling realistic and diverse expression synthesis.
Contribution
The method uniquely combines 3D face modeling with disentangled shape and texture networks conditioned on expression coefficients, allowing unlimited expression generation with high realism.
Findings
Preferred in 85% of user study cases over recent methods.
Cannot reliably distinguish synthesized images from real images in ground truth comparisons.
Achieves high-quality expression manipulation validated through quantitative and qualitative evaluations.
Abstract
We present a method for fine-grained face manipulation. Given a face image with an arbitrary expression, our method can synthesize another arbitrary expression by the same person. This is achieved by first fitting a 3D face model and then disentangling the face into a texture and a shape. We then learn different networks in these two spaces. In the texture space, we use a conditional generative network to change the appearance, and carefully design input formats and loss functions to achieve the best results. In the shape space, we use a fully connected network to predict the accurate shapes and use the available depth data for supervision. Both networks are conditioned on expression coefficients rather than discrete labels, allowing us to generate an unlimited amount of expressions. We show the superiority of this disentangling approach through both quantitative and qualitative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Generative Adversarial Networks and Image Synthesis · Speech and Audio Processing
