PSAvatar: A Point-based Shape Model for Real-Time Head Avatar Animation   with 3D Gaussian Splatting

Zhongyuan Zhao; Zhenyu Bao; Qing Li; Guoping Qiu; Kanglin; Liu

arXiv:2401.12900·cs.GR·June 25, 2024·2 cites

PSAvatar: A Point-based Shape Model for Real-Time Head Avatar Animation with 3D Gaussian Splatting

Zhongyuan Zhao, Zhenyu Bao, Qing Li, Guoping Qiu, Kanglin, Liu

PDF

Open Access 1 Repo

TL;DR

PSAvatar introduces a point-based shape model combined with 3D Gaussian representations to enable real-time, high-fidelity head avatar animation that captures complex geometries like hairstyles and eyeglasses.

Contribution

The paper presents a novel point-based morphable shape model integrated with 3D Gaussian for detailed, flexible, and real-time head avatar creation and animation.

Findings

01

Achieves real-time animation at 25 fps with high resolution.

02

Effectively models complex geometries like hairstyles and eyeglasses.

03

Provides high-fidelity head avatar reconstructions across subjects.

Abstract

Despite much progress, achieving real-time high-fidelity head avatar animation is still difficult and existing methods have to trade-off between speed and quality. 3DMM based methods often fail to model non-facial structures such as eyeglasses and hairstyles, while neural implicit models suffer from deformation inflexibility and rendering inefficiency. Although 3D Gaussian has been demonstrated to possess promising capability for geometry representation and radiance field reconstruction, applying 3D Gaussian in head avatar creation remains a major challenge since it is difficult for 3D Gaussian to model the head shape variations caused by changing poses and expressions. In this paper, we introduce PSAvatar, a novel framework for animatable head avatar creation that utilizes discrete geometric primitive to create a parametric morphable shape model and employs 3D Gaussian for fine detail…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

pcl3dv/PSAvatar
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · 3D Shape Modeling and Analysis · Advanced Vision and Imaging

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings