DF-3DFace: One-to-Many Speech Synchronized 3D Face Animation with Diffusion
Se Jin Park, Joanna Hong, Minsu Kim, Yong Man Ro

TL;DR
DF-3DFace introduces a diffusion-based approach for speech-driven 3D facial animation that captures one-to-many relationships, enabling realistic and varied facial expressions, identities, and poses without needing reference meshes.
Contribution
The paper presents a novel diffusion-driven method for speech-to-3D face synthesis that models complex variability in facial attributes and introduces a large-scale dataset for training.
Findings
Successfully generates diverse facial shapes and motions from speech.
Achieves more realistic facial animations than existing methods.
Effectively models identity, pose, and facial motion jointly.
Abstract
Speech-driven 3D facial animation has gained significant attention for its ability to create realistic and expressive facial animations in 3D space based on speech. Learning-based methods have shown promising progress in achieving accurate facial motion synchronized with speech. However, one-to-many nature of speech-to-3D facial synthesis has not been fully explored: while the lip accurately synchronizes with the speech content, other facial attributes beyond speech-related motions are variable with respect to the speech. To account for the potential variance in the facial attributes within a single speech, we propose DF-3DFace, a diffusion-driven speech-to-3D face mesh synthesis. DF-3DFace captures the complex one-to-many relationships between speech and 3D face based on diffusion. It concurrently achieves aligned lip motion by exploiting audio-mesh synchronization and masked…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Facial Nerve Paralysis Treatment and Research
