Audio2Rig: Artist-oriented deep learning tool for facial animation

Bastien Arcelin; Nicolas Chaverou

arXiv:2405.20412·cs.GR·June 3, 2024

Audio2Rig: Artist-oriented deep learning tool for facial animation

Bastien Arcelin, Nicolas Chaverou

PDF

TL;DR

Audio2Rig is a deep learning tool that automates facial and lip sync animation from audio, enabling stylized, high-quality results that fit seamlessly into animator workflows while ensuring data privacy.

Contribution

It introduces a show-specific deep learning method that generates rig animation directly from audio, adaptable to different facial parts and styles without rig adjustments.

Findings

01

Produces high-quality, stylized facial animations

02

Allows customization of emotions and intensities

03

Ensures data privacy by training on studio data

Abstract

Creating realistic or stylized facial and lip sync animation is a tedious task. It requires lot of time and skills to sync the lips with audio and convey the right emotion to the character's face. To allow animators to spend more time on the artistic and creative part of the animation, we present Audio2Rig: a new deep learning based tool leveraging previously animated sequences of a show, to generate facial and lip sync rig animation from an audio file. Based in Maya, it learns from any production rig without any adjustment and generates high quality and stylized animations which mimic the style of the show. Audio2Rig fits in the animator workflow: since it generates keys on the rig controllers, the animation can be easily retaken. The method is based on 3 neural network modules which can learn an arbitrary number of controllers. Hence, different configurations can be created for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.