DF-3DFace: One-to-Many Speech Synchronized 3D Face Animation with   Diffusion

Se Jin Park; Joanna Hong; Minsu Kim; Yong Man Ro

arXiv:2310.05934·cs.CV·October 11, 2023·2 cites

DF-3DFace: One-to-Many Speech Synchronized 3D Face Animation with Diffusion

Se Jin Park, Joanna Hong, Minsu Kim, Yong Man Ro

PDF

Open Access

TL;DR

DF-3DFace introduces a diffusion-based approach for speech-driven 3D facial animation that captures one-to-many relationships, enabling realistic and varied facial expressions, identities, and poses without needing reference meshes.

Contribution

The paper presents a novel diffusion-driven method for speech-to-3D face synthesis that models complex variability in facial attributes and introduces a large-scale dataset for training.

Findings

01

Successfully generates diverse facial shapes and motions from speech.

02

Achieves more realistic facial animations than existing methods.

03

Effectively models identity, pose, and facial motion jointly.

Abstract

Speech-driven 3D facial animation has gained significant attention for its ability to create realistic and expressive facial animations in 3D space based on speech. Learning-based methods have shown promising progress in achieving accurate facial motion synchronized with speech. However, one-to-many nature of speech-to-3D facial synthesis has not been fully explored: while the lip accurately synchronizes with the speech content, other facial attributes beyond speech-related motions are variable with respect to the speech. To account for the potential variance in the facial attributes within a single speech, we propose DF-3DFace, a diffusion-driven speech-to-3D face mesh synthesis. DF-3DFace captures the complex one-to-many relationships between speech and 3D face based on diffusion. It concurrently achieves aligned lip motion by exploiting audio-mesh synchronization and masked…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Facial Nerve Paralysis Treatment and Research