Identity-Preserving Realistic Talking Face Generation
Sanjana Sinha, Sandika Biswas, Brojeshwar Bhowmick

TL;DR
This paper presents a novel speech-driven facial animation method that simultaneously preserves identity, synchronizes audio-visual cues, and incorporates natural eye blinks, resulting in more realistic and accurate talking face videos.
Contribution
It introduces a comprehensive approach combining landmark generation, eye blink imposition, and texture synthesis to enhance realism and identity preservation in speech-driven facial animation.
Findings
Significant improvement in lip synchronization accuracy.
Enhanced image sharpness and reconstruction quality.
Higher user-rated realism compared to state-of-the-art methods.
Abstract
Speech-driven facial animation is useful for a variety of applications such as telepresence, chatbots, etc. The necessary attributes of having a realistic face animation are 1) audio-visual synchronization (2) identity preservation of the target individual (3) plausible mouth movements (4) presence of natural eye blinks. The existing methods mostly address the audio-visual lip synchronization, and few recent works have addressed the synthesis of natural eye blinks for overall video realism. In this paper, we propose a method for identity-preserving realistic facial animation from speech. We first generate person-independent facial landmarks from audio using DeepSpeech features for invariance to different voices, accents, etc. To add realism, we impose eye blinks on facial landmarks using unsupervised learning and retargets the person-independent landmarks to person-specific landmarks to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Batch Normalization · Dense Connections · Convolution · HuMan(Expedia)||How do I get a human at Expedia? · GAN Least Squares Loss · LSGAN
