Meta-Learning Empowered Meta-Face: Personalized Speaking Style   Adaptation for Audio-Driven 3D Talking Face Animation

Xukun Zhou; Fengxin Li; Ziqiao Peng; Kejian Wu; Jun He; Biao Qin,; Zhaoxin Fan; Hongyan Liu

arXiv:2408.09357·cs.GR·August 20, 2024

Meta-Learning Empowered Meta-Face: Personalized Speaking Style Adaptation for Audio-Driven 3D Talking Face Animation

Xukun Zhou, Fengxin Li, Ziqiao Peng, Kejian Wu, Jun He, Biao Qin,, Zhaoxin Fan, Hongyan Liu

PDF

Open Access

TL;DR

MetaFace is a meta-learning based approach that enables personalized, adaptable 3D talking face animation across diverse speaking styles, outperforming existing methods in efficiency and accuracy.

Contribution

The paper introduces MetaFace, a novel meta-learning framework with innovative components for effective and efficient speaking style adaptation in 3D face animation.

Findings

01

MetaFace outperforms existing baselines in speaking style adaptation.

02

MetaFace establishes a new state-of-the-art in personalized 3D face animation.

03

The approach improves efficiency in model optimization and style learning.

Abstract

Audio-driven 3D face animation is increasingly vital in live streaming and augmented reality applications. While remarkable progress has been observed, most existing approaches are designed for specific individuals with predefined speaking styles, thus neglecting the adaptability to varied speaking styles. To address this limitation, this paper introduces MetaFace, a novel methodology meticulously crafted for speaking style adaptation. Grounded in the novel concept of meta-learning, MetaFace is composed of several key components: the Robust Meta Initialization Stage (RMIS) for fundamental speaking style adaptation, the Dynamic Relation Mining Neural Process (DRMN) for forging connections between observed and unobserved speaking styles, and the Low-rank Matrix Memory Reduction Approach to enhance the efficiency of model optimization as well as learning style details. Leveraging these…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Human Motion and Animation · Generative Adversarial Networks and Image Synthesis