FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head   Models

Shivangi Aneja; Justus Thies; Angela Dai; Matthias Nie{\ss}ner

arXiv:2312.08459·cs.CV·March 19, 2024·1 cites

FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head Models

Shivangi Aneja, Justus Thies, Angela Dai, Matthias Nie{\ss}ner

PDF

Open Access 1 Repo

TL;DR

FaceTalk introduces a novel audio-driven diffusion model for generating high-fidelity, realistic 3D head motion sequences from speech, advancing volumetric human head animation with superior naturalness.

Contribution

This work is the first to combine neural parametric head models with a latent diffusion approach for realistic, audio-driven 3D head motion synthesis, including hair and eye movements.

Findings

01

Achieves 75% improvement in perceptual user study.

02

Produces diverse, natural facial expressions.

03

Outperforms existing methods in motion realism.

Abstract

We introduce FaceTalk, a novel generative approach designed for synthesizing high-fidelity 3D motion sequences of talking human heads from input audio signal. To capture the expressive, detailed nature of human heads, including hair, ears, and finer-scale eye movements, we propose to couple speech signal with the latent space of neural parametric head models to create high-fidelity, temporally coherent motion sequences. We propose a new latent diffusion model for this task, operating in the expression space of neural parametric head models, to synthesize audio-driven realistic head sequences. In the absence of a dataset with corresponding NPHM expressions to audio, we optimize for these correspondences to produce a dataset of temporally-optimized NPHM expressions fit to audio-video recordings of people talking. To the best of our knowledge, this is the first work to propose a generative…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

shivangi-aneja/FaceTalk
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Generative Adversarial Networks and Image Synthesis · Human Motion and Animation

MethodsDiffusion · Latent Diffusion Model