Gaussian Eigen Models for Human Heads

Wojciech Zielonka; Timo Bolkart; Thabo Beeler; and Justus Thies

arXiv:2407.04545·cs.CV·April 1, 2025

Gaussian Eigen Models for Human Heads

Wojciech Zielonka, Timo Bolkart, Thabo Beeler, and Justus Thies

PDF

Open Access

TL;DR

This paper introduces Gaussian Eigen Models (GEM), a lightweight, high-quality head avatar representation that combines 3D Gaussian primitives with eigenbasis decomposition, enabling efficient and realistic facial animation from a single image.

Contribution

The paper proposes GEM, a novel eigenbasis-based head avatar model that distills high-quality CNN-generated avatars into a lightweight, controllable, and easily animatable representation.

Findings

01

GEM achieves higher visual quality than state-of-the-art methods.

02

GEM generalizes better to new facial expressions.

03

GEM enables efficient head avatar generation from a single image.

Abstract

Current personalized neural head avatars face a trade-off: lightweight models lack detail and realism, while high-quality, animatable avatars require significant computational resources, making them unsuitable for commodity devices. To address this gap, we introduce Gaussian Eigen Models (GEM), which provide high-quality, lightweight, and easily controllable head avatars. GEM utilizes 3D Gaussian primitives for representing the appearance combined with Gaussian splatting for rendering. Building on the success of mesh-based 3D morphable face models (3DMM), we define GEM as an ensemble of linear eigenbases for representing the head appearance of a specific subject. In particular, we construct linear bases to represent the position, scale, rotation, and opacity of the 3D Gaussians. This allows us to efficiently generate Gaussian primitives of a specific head shape by a linear combination…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMorphological variations and asymmetry

MethodsLinear Layer · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings