Preface: A Data-driven Volumetric Prior for Few-shot Ultra   High-resolution Face Synthesis

Marcel C. B\"uhler (1; 2); Kripasindhu Sarkar (2); Tanmay Shah (2),; Gengyan Li (1; 2); Daoye Wang (2); Leonhard Helminger (2); Sergio; Orts-Escolano (2); Dmitry Lagun (2); Otmar Hilliges (1); Thabo Beeler (2),; Abhimitra Meka (2) ((1) ETH Zurich; (2) Google)

arXiv:2309.16859·cs.CV·October 2, 2023

Preface: A Data-driven Volumetric Prior for Few-shot Ultra High-resolution Face Synthesis

Marcel C. B\"uhler (1, 2), Kripasindhu Sarkar (2), Tanmay Shah (2),, Gengyan Li (1, 2), Daoye Wang (2), Leonhard Helminger (2), Sergio, Orts-Escolano (2), Dmitry Lagun (2), Otmar Hilliges (1), Thabo Beeler (2),, Abhimitra Meka (2) ((1) ETH Zurich, (2) Google)

PDF

1 Video

TL;DR

This paper introduces a data-driven volumetric prior for ultra high-resolution face synthesis that enables realistic, multi-view face generation from minimal input views, reducing hardware needs and broadening applicability.

Contribution

It presents a novel identity-conditioned NeRF model trained on low-res multi-view images, allowing high-quality face synthesis from just two casual images.

Findings

01

High-quality face synthesis from 2-3 views.

02

Model generalizes to unseen identities.

03

Requires minimal multi-view input at inference.

Abstract

NeRFs have enabled highly realistic synthesis of human faces including complex appearance and reflectance effects of hair and skin. These methods typically require a large number of multi-view input images, making the process hardware intensive and cumbersome, limiting applicability to unconstrained settings. We propose a novel volumetric human face prior that enables the synthesis of ultra high-resolution novel views of subjects that are not part of the prior's training distribution. This prior model consists of an identity-conditioned NeRF, trained on a dataset of low-resolution multi-view images of diverse humans with known camera calibration. A simple sparse landmark-based 3D alignment of the training dataset allows our model to learn a smooth latent space of geometry and appearance despite a limited number of training identities. A high-quality volumetric representation of a novel…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Preface: A Data-driven Volumetric Prior for Few-shot Ultra High-resolution Face Synthesis· youtube