Single Image, Any Face: Generalisable 3D Face Generation

Wenqing Wang; Haosen Yang; Josef Kittler; Xiatian Zhu

arXiv:2409.16990·cs.CV·March 10, 2026

Single Image, Any Face: Generalisable 3D Face Generation

Wenqing Wang, Haosen Yang, Josef Kittler, Xiatian Zhu

PDF

Open Access

TL;DR

This paper introduces Gen3D-Face, a novel diffusion-based model that generates photorealistic 3D human faces from a single unconstrained image, achieving high generalization and multi-view consistency without requiring ground-truth 3D data.

Contribution

The paper presents the first unified framework for single-image 3D face generation that works across domains, using a multi-view diffusion approach and subject-specific mesh estimation.

Findings

01

Outperforms previous methods in out-of-domain scenarios

02

Achieves top results in in-domain competitions

03

Demonstrates high multi-view consistency and realism

Abstract

The creation of 3D human face avatars from a single unconstrained image is a fundamental task that underlies numerous real-world vision and graphics applications. Despite the significant progress made in generative models, existing methods are either less suited in design for human faces or fail to generalise from the restrictive training domain to unconstrained facial images. To address these limitations, we propose a novel model, Gen3D-Face, which generates 3D human faces with unconstrained single image input within a multi-view consistent diffusion framework. Given a specific input image, our model first produces multi-view images, followed by neural surface construction. To incorporate face geometry information while preserving generalisation to in-the-wild inputs, we estimate a subject-specific mesh directly from the input image, enabling training and evaluation without…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis

MethodsDiffusion