Self-Supervised 3D Face Reconstruction via Conditional Estimation

Yandong Wen; Weiyang Liu; Bhiksha Raj; Rita Singh

arXiv:2110.04800·cs.CV·October 12, 2021

Self-Supervised 3D Face Reconstruction via Conditional Estimation

Yandong Wen, Weiyang Liu, Bhiksha Raj, Rita Singh

PDF

Open Access

TL;DR

This paper introduces CEST, a self-supervised framework for 3D face reconstruction from 2D images using analysis-by-synthesis and statistical dependency modeling, trained on videos without explicit labels.

Contribution

The novel CEST framework estimates 3D facial parameters conditioned on previous estimates, leveraging symmetry and consistency for self-supervised learning from videos.

Findings

01

Effective 3D face reconstruction demonstrated on in-the-wild videos.

02

Improved disentanglement of facial parameters through symmetry and consistency constraints.

03

Qualitative and quantitative results validate the approach.

Abstract

We present a conditional estimation (CEST) framework to learn 3D facial parameters from 2D single-view images by self-supervised training from videos. CEST is based on the process of analysis by synthesis, where the 3D facial parameters (shape, reflectance, viewpoint, and illumination) are estimated from the face image, and then recombined to reconstruct the 2D face image. In order to learn semantically meaningful 3D facial parameters without explicit access to their labels, CEST couples the estimation of different 3D facial parameters by taking their statistical dependency into account. Specifically, the estimation of any 3D facial parameter is not only conditioned on the given image, but also on the facial parameters that have already been derived. Moreover, the reflectance symmetry and consistency among the video frames are adopted to improve the disentanglement of facial parameters.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Video Surveillance and Tracking Methods · Face and Expression Recognition