GaussianHeads: End-to-End Learning of Drivable Gaussian Head Avatars   from Coarse-to-fine Representations

Kartik Teotia; Hyeongwoo Kim; Pablo Garrido; Marc Habermann; Mohamed; Elgharib; Christian Theobalt

arXiv:2409.11951·cs.CV·September 19, 2024

GaussianHeads: End-to-End Learning of Drivable Gaussian Head Avatars from Coarse-to-fine Representations

Kartik Teotia, Hyeongwoo Kim, Pablo Garrido, Marc Habermann, Mohamed, Elgharib, Christian Theobalt

PDF

Open Access

TL;DR

GaussianHeads introduces an end-to-end hierarchical model for real-time, highly dynamic human head avatar rendering from multi-view images, capturing complex facial expressions and head movements with high fidelity.

Contribution

The paper presents a novel coarse-to-fine hierarchical approach that learns deformable head models and Gaussian representations for controllable, high-quality avatar synthesis from multi-view data.

Findings

01

Achieves high-fidelity rendering of complex facial expressions.

02

Enables controllable facial animation from video inputs.

03

Demonstrates generalization to new expressions and head poses.

Abstract

Real-time rendering of human head avatars is a cornerstone of many computer graphics applications, such as augmented reality, video games, and films, to name a few. Recent approaches address this challenge with computationally efficient geometry primitives in a carefully calibrated multi-view setup. Albeit producing photorealistic head renderings, it often fails to represent complex motion changes such as the mouth interior and strongly varying head poses. We propose a new method to generate highly dynamic and deformable human head avatars from multi-view imagery in real-time. At the core of our method is a hierarchical representation of head models that allows to capture the complex dynamics of facial expressions and head movements. First, with rich facial features extracted from raw input frames, we learn to deform the coarse facial geometry of the template mesh. We then initialize 3D…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSocial Robot Interaction and HRI · Human Pose and Action Recognition · Context-Aware Activity Recognition Systems