PSAvatar: A Point-based Shape Model for Real-Time Head Avatar Animation with 3D Gaussian Splatting
Zhongyuan Zhao, Zhenyu Bao, Qing Li, Guoping Qiu, Kanglin, Liu

TL;DR
PSAvatar introduces a point-based shape model combined with 3D Gaussian representations to enable real-time, high-fidelity head avatar animation that captures complex geometries like hairstyles and eyeglasses.
Contribution
The paper presents a novel point-based morphable shape model integrated with 3D Gaussian for detailed, flexible, and real-time head avatar creation and animation.
Findings
Achieves real-time animation at 25 fps with high resolution.
Effectively models complex geometries like hairstyles and eyeglasses.
Provides high-fidelity head avatar reconstructions across subjects.
Abstract
Despite much progress, achieving real-time high-fidelity head avatar animation is still difficult and existing methods have to trade-off between speed and quality. 3DMM based methods often fail to model non-facial structures such as eyeglasses and hairstyles, while neural implicit models suffer from deformation inflexibility and rendering inefficiency. Although 3D Gaussian has been demonstrated to possess promising capability for geometry representation and radiance field reconstruction, applying 3D Gaussian in head avatar creation remains a major challenge since it is difficult for 3D Gaussian to model the head shape variations caused by changing poses and expressions. In this paper, we introduce PSAvatar, a novel framework for animatable head avatar creation that utilizes discrete geometric primitive to create a parametric morphable shape model and employs 3D Gaussian for fine detail…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · 3D Shape Modeling and Analysis · Advanced Vision and Imaging
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
