ScaffoldAvatar: High-Fidelity Gaussian Avatars with Patch Expressions
Shivangi Aneja, Sebastian Weiss, Irene Baeza, Prashanth Chandran, Gaspard Zoss, Matthias Nie{\ss}ner, Derek Bradley

TL;DR
This paper introduces ScaffoldAvatar, a novel method for creating high-fidelity, expressive 3D head avatars using patch-based local expressions and 3D Gaussian splatting, enabling real-time, photorealistic animations.
Contribution
It proposes a patch-based local expression modeling approach combined with 3D Gaussian splatting for ultra-high fidelity avatar synthesis, surpassing global expression methods.
Findings
Achieves state-of-the-art visual quality and realism.
Supports diverse facial expressions and styles in real time.
Faster convergence with high-resolution training images.
Abstract
Generating high-fidelity real-time animated sequences of photorealistic 3D head avatars is important for many graphics applications, including immersive telepresence and movies. This is a challenging problem particularly when rendering digital avatar close-ups for showing character's facial microfeatures and expressions. To capture the expressive, detailed nature of human heads, including skin furrowing and finer-scale facial movements, we propose to couple locally-defined facial expressions with 3D Gaussian splatting to enable creating ultra-high fidelity, expressive and photorealistic 3D head avatars. In contrast to previous works that operate on a global expression space, we condition our avatar's dynamics on patch-based local expression features and synthesize 3D Gaussians at a patch level. In particular, we leverage a patch-based geometric 3D face model to extract patch expressions…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
