FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head Models
Shivangi Aneja, Justus Thies, Angela Dai, Matthias Nie{\ss}ner

TL;DR
FaceTalk introduces a novel audio-driven diffusion model for generating high-fidelity, realistic 3D head motion sequences from speech, advancing volumetric human head animation with superior naturalness.
Contribution
This work is the first to combine neural parametric head models with a latent diffusion approach for realistic, audio-driven 3D head motion synthesis, including hair and eye movements.
Findings
Achieves 75% improvement in perceptual user study.
Produces diverse, natural facial expressions.
Outperforms existing methods in motion realism.
Abstract
We introduce FaceTalk, a novel generative approach designed for synthesizing high-fidelity 3D motion sequences of talking human heads from input audio signal. To capture the expressive, detailed nature of human heads, including hair, ears, and finer-scale eye movements, we propose to couple speech signal with the latent space of neural parametric head models to create high-fidelity, temporally coherent motion sequences. We propose a new latent diffusion model for this task, operating in the expression space of neural parametric head models, to synthesize audio-driven realistic head sequences. In the absence of a dataset with corresponding NPHM expressions to audio, we optimize for these correspondences to produce a dataset of temporally-optimized NPHM expressions fit to audio-video recordings of people talking. To the best of our knowledge, this is the first work to propose a generative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Generative Adversarial Networks and Image Synthesis · Human Motion and Animation
MethodsDiffusion · Latent Diffusion Model
