TL;DR
This paper introduces a novel dynamic neural radiance field approach for reconstructing and rendering 4D facial avatars from monocular videos, enabling realistic and controllable face modeling for AR/VR applications.
Contribution
It combines implicit neural scene representations with a morphable model to handle facial dynamics, learned solely from monocular data, surpassing existing methods in realism.
Findings
Achieves photo-realistic facial image generation.
Outperforms state-of-the-art video reenactment methods.
Operates without specialized capture setups.
Abstract
We present dynamic neural radiance fields for modeling the appearance and dynamics of a human face. Digitally modeling and reconstructing a talking human is a key building-block for a variety of applications. Especially, for telepresence applications in AR or VR, a faithful reproduction of the appearance including novel viewpoints or head-poses is required. In contrast to state-of-the-art approaches that model the geometry and material properties explicitly, or are purely image-based, we introduce an implicit representation of the head based on scene representation networks. To handle the dynamics of the face, we combine our scene representation network with a low-dimensional morphable model which provides explicit control over pose and expressions. We use volumetric rendering to generate images from this hybrid representation and demonstrate that such a dynamic neural scene…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
