Single Image, Any Face: Generalisable 3D Face Generation
Wenqing Wang, Haosen Yang, Josef Kittler, Xiatian Zhu

TL;DR
This paper introduces Gen3D-Face, a novel diffusion-based model that generates photorealistic 3D human faces from a single unconstrained image, achieving high generalization and multi-view consistency without requiring ground-truth 3D data.
Contribution
The paper presents the first unified framework for single-image 3D face generation that works across domains, using a multi-view diffusion approach and subject-specific mesh estimation.
Findings
Outperforms previous methods in out-of-domain scenarios
Achieves top results in in-domain competitions
Demonstrates high multi-view consistency and realism
Abstract
The creation of 3D human face avatars from a single unconstrained image is a fundamental task that underlies numerous real-world vision and graphics applications. Despite the significant progress made in generative models, existing methods are either less suited in design for human faces or fail to generalise from the restrictive training domain to unconstrained facial images. To address these limitations, we propose a novel model, Gen3D-Face, which generates 3D human faces with unconstrained single image input within a multi-view consistent diffusion framework. Given a specific input image, our model first produces multi-view images, followed by neural surface construction. To incorporate face geometry information while preserving generalisation to in-the-wild inputs, we estimate a subject-specific mesh directly from the input image, enabling training and evaluation without…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis
MethodsDiffusion
