TL;DR
This paper introduces a novel neural radiance field-based semantic facial model that can be quickly personalized from monocular video, enabling realistic rendering and facial editing with interpretable bases.
Contribution
The paper proposes a new semantic head model using multi-level voxel fields within NeRF, allowing fast personalization and detailed facial attribute representation from monocular video.
Findings
Constructs personalized facial NeRF models in 10-20 minutes.
Achieves photo-realistic rendering in tens of milliseconds.
Demonstrates strong representation and editing capabilities.
Abstract
We present a novel semantic model for human head defined with neural radiance field. The 3D-consistent head model consist of a set of disentangled and interpretable bases, and can be driven by low-dimensional expression coefficients. Thanks to the powerful representation ability of neural radiance field, the constructed model can represent complex facial attributes including hair, wearings, which can not be represented by traditional mesh blendshape. To construct the personalized semantic facial model, we propose to define the bases as several multi-level voxel fields. With a short monocular RGB video as input, our method can construct the subject's semantic facial NeRF model with only ten to twenty minutes, and can render a photo-realistic human head image in tens of miliseconds with a given expression coefficient and view direction. With this novel representation, we apply it to many…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
