Neural Face Models for Example-Based Visual Speech Synthesis
Wolfgang Paier, Anna Hilsmann, Peter Eisert

TL;DR
This paper introduces a neural face model that synthesizes high-quality 3D face geometry and textures from compact latent vectors, enabling realistic, seamless, and memory-efficient example-based facial animation for applications like sign language synthesis.
Contribution
It combines example-based animation with neural face models to reduce memory use and improve transition realism in facial motion synthesis.
Findings
Reduces memory requirements by a factor of 100
Creates seamless transitions between motion samples
Successfully synthesizes mouthings for Swiss-German sign language
Abstract
Creating realistic animations of human faces with computer graphic models is still a challenging task. It is often solved either with tedious manual work or motion capture based techniques that require specialised and costly hardware. Example based animation approaches circumvent these problems by re-using captured data of real people. This data is split into short motion samples that can be looped or concatenated in order to create novel motion sequences. The obvious advantages of this approach are the simplicity of use and the high realism, since the data exhibits only real deformations. Rather than tuning weights of a complex face rig, the animation task is performed on a higher level by arranging typical motion samples in a way such that the desired facial performance is achieved. Two difficulties with example based approaches, however, are high memory requirements as well as the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
