TalkingGaussian: Structure-Persistent 3D Talking Head Synthesis via Gaussian Splatting
Jiahe Li, Jiawei Zhang, Xiao Bai, Jin Zheng, Xin Ning, Jun Zhou, Lin, Gu

TL;DR
TalkingGaussian introduces a deformation-based radiance fields framework using Gaussian Splatting for high-fidelity, lip-synchronized 3D talking head synthesis that preserves facial features and improves efficiency.
Contribution
It proposes a novel deformation paradigm with Gaussian primitives for stable facial motion synthesis and addresses face-mouth motion inconsistency with a dual-branch model.
Findings
Produces high-quality lip-synchronized videos
Achieves better facial fidelity
Offers higher efficiency
Abstract
Radiance fields have demonstrated impressive performance in synthesizing lifelike 3D talking heads. However, due to the difficulty in fitting steep appearance changes, the prevailing paradigm that presents facial motions by directly modifying point appearance may lead to distortions in dynamic regions. To tackle this challenge, we introduce TalkingGaussian, a deformation-based radiance fields framework for high-fidelity talking head synthesis. Leveraging the point-based Gaussian Splatting, facial motions can be represented in our method by applying smooth and continuous deformations to persistent Gaussian primitives, without requiring to learn the difficult appearance change like previous methods. Due to this simplification, precise facial motions can be synthesized while keeping a highly intact facial feature. Under such a deformation paradigm, we further identify a face-mouth motion…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · 3D Shape Modeling and Analysis · Image Processing and 3D Reconstruction
