Instant Volumetric Head Avatars
Wojciech Zielonka, Timo Bolkart, Justus Thies

TL;DR
INSTA is a fast, neural radiance field-based method for creating photo-realistic digital head avatars from a single video, enabling real-time rendering and pose extrapolation with significantly reduced training time.
Contribution
The paper introduces INSTA, a novel neural radiance field approach that reconstructs avatars in under 10 minutes from monocular video, outperforming existing methods in speed and quality.
Findings
Reconstructs avatars in less than 10 minutes on modern GPUs.
Outperforms state-of-the-art methods in rendering quality.
Enables interactive rendering and pose extrapolation.
Abstract
We present Instant Volumetric Head Avatars (INSTA), a novel approach for reconstructing photo-realistic digital avatars instantaneously. INSTA models a dynamic neural radiance field based on neural graphics primitives embedded around a parametric face model. Our pipeline is trained on a single monocular RGB portrait video that observes the subject under different expressions and views. While state-of-the-art methods take up to several days to train an avatar, our method can reconstruct a digital avatar in less than 10 minutes on modern GPU hardware, which is orders of magnitude faster than previous solutions. In addition, it allows for the interactive rendering of novel poses and expressions. By leveraging the geometry prior of the underlying parametric face model, we demonstrate that INSTA extrapolates to unseen poses. In quantitative and qualitative studies on various subjects, INSTA…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Generative Adversarial Networks and Image Synthesis · Advanced Vision and Imaging
