GaussianHeads: End-to-End Learning of Drivable Gaussian Head Avatars from Coarse-to-fine Representations
Kartik Teotia, Hyeongwoo Kim, Pablo Garrido, Marc Habermann, Mohamed, Elgharib, Christian Theobalt

TL;DR
GaussianHeads introduces an end-to-end hierarchical model for real-time, highly dynamic human head avatar rendering from multi-view images, capturing complex facial expressions and head movements with high fidelity.
Contribution
The paper presents a novel coarse-to-fine hierarchical approach that learns deformable head models and Gaussian representations for controllable, high-quality avatar synthesis from multi-view data.
Findings
Achieves high-fidelity rendering of complex facial expressions.
Enables controllable facial animation from video inputs.
Demonstrates generalization to new expressions and head poses.
Abstract
Real-time rendering of human head avatars is a cornerstone of many computer graphics applications, such as augmented reality, video games, and films, to name a few. Recent approaches address this challenge with computationally efficient geometry primitives in a carefully calibrated multi-view setup. Albeit producing photorealistic head renderings, it often fails to represent complex motion changes such as the mouth interior and strongly varying head poses. We propose a new method to generate highly dynamic and deformable human head avatars from multi-view imagery in real-time. At the core of our method is a hierarchical representation of head models that allows to capture the complex dynamics of facial expressions and head movements. First, with rich facial features extracted from raw input frames, we learn to deform the coarse facial geometry of the template mesh. We then initialize 3D…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSocial Robot Interaction and HRI · Human Pose and Action Recognition · Context-Aware Activity Recognition Systems
