TL;DR
Audio2Face-3D is a real-time, open-source system that generates realistic facial animations for digital avatars driven by audio input, enhancing interactive experiences and avatar creation.
Contribution
This paper introduces NVIDIA Audio2Face-3D, a comprehensive system with open-source tools for audio-driven facial animation of digital avatars, including data, architecture, and evaluation methods.
Findings
Enables real-time facial animation for avatars
Provides open-source SDK and training framework
Facilitates realistic avatar interaction
Abstract
Audio-driven facial animation presents an effective solution for animating digital avatars. In this paper, we detail the technical aspects of NVIDIA Audio2Face-3D, including data acquisition, network architecture, retargeting methodology, evaluation metrics, and use cases. Audio2Face-3D system enables real-time interaction between human users and interactive avatars, facilitating facial animation authoring for game characters. To assist digital avatar creators and game developers in generating realistic facial animations, we have open-sourced Audio2Face-3D networks, SDK, training framework, and example dataset.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗nvidia/Audio2Emotion-v3.0model· 34 dl· ♡ 3334 dl♡ 33
- 🤗nvidia/Audio2Emotion-v2.2model· 55 dl· ♡ 2255 dl♡ 22
- 🤗nvidia/Audio2Face-3D-v3.0model· 183 dl· ♡ 68183 dl♡ 68
- 🤗nvidia/Audio2Face-3D-v2.3-Markmodel· 84 dl· ♡ 1484 dl♡ 14
- 🤗nvidia/Audio2Face-3D-v2.3.1-Clairemodel· 78 dl· ♡ 978 dl♡ 9
- 🤗nvidia/Audio2Face-3D-v2.3.1-Jamesmodel· 64 dl· ♡ 664 dl♡ 6
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
