HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting

Zhenglin Zhou; Fan Ma; Hehe Fan; Zongxin Yang; Yi Yang

arXiv:2402.06149·cs.CV·December 24, 2024·1 cites

HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting

Zhenglin Zhou, Fan Ma, Hehe Fan, Zongxin Yang, Yi Yang

PDF

Open Access 1 Repo

TL;DR

HeadStudio introduces a novel 3D Gaussian splatting framework that creates realistic, animatable head avatars from text prompts, enabling high-quality, real-time rendering and smooth animation driven by speech or video.

Contribution

It presents a new method combining 3D Gaussian splatting with semantic animation for high-quality, text-driven, animatable head avatars, improving upon prior static and less detailed models.

Findings

01

Generates realistic avatars from text prompts.

02

Achieves real-time rendering at 40 fps with high resolution.

03

Supports smooth animation driven by speech and video.

Abstract

Creating digital avatars from textual prompts has long been a desirable yet challenging task. Despite the promising results achieved with 2D diffusion priors, current methods struggle to create high-quality and consistent animated avatars efficiently. Previous animatable head models like FLAME have difficulty in accurately representing detailed texture and geometry. Additionally, high-quality 3D static representations face challenges in semantically driving with dynamic priors. In this paper, we introduce \textbf{HeadStudio}, a novel framework that utilizes 3D Gaussian splatting to generate realistic and animatable avatars from text prompts. Firstly, we associate 3D Gaussians with animatable head prior model, facilitating semantic animation on high-quality 3D representations. To ensure consistent animation, we further enhance the optimization from initialization, distillation, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zhenglinzhou/headstudio
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAugmented Reality Applications · Human Motion and Animation · Interactive and Immersive Displays

MethodsDiffusion