Language-Guided Face Animation by Recurrent StyleGAN-based Generator
Tiankai Hang, Huan Yang, Bei Liu, Jianlong Fu, Xin Geng, Baining Guo

TL;DR
This paper introduces a novel framework for language-guided face animation that leverages a recurrent motion generator and StyleGAN, enabling realistic video synthesis from a single image guided by natural language.
Contribution
It proposes a new method combining language semantics and motions with a recurrent generator and StyleGAN for high-quality face animation.
Findings
Effective in animating diverse face types including human, anime, and dog faces.
Produces high-quality, realistic videos from static images guided by language.
Outperforms existing methods in both qualitative and quantitative evaluations.
Abstract
Recent works on language-guided image manipulation have shown great power of language in providing rich semantics, especially for face images. However, the other natural information, motions, in language is less explored. In this paper, we leverage the motion information and study a novel task, language-guided face animation, that aims to animate a static face image with the help of languages. To better utilize both semantics and motions from languages, we propose a simple yet effective framework. Specifically, we propose a recurrent motion generator to extract a series of semantic and motion information from the language and feed it along with visual information to a pre-trained StyleGAN to generate high-quality frames. To optimize the proposed framework, three carefully designed loss functions are proposed including a regularization loss to keep the face identity, a path length…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Face recognition and analysis · Human Motion and Animation
MethodsStyleGAN · Adaptive Instance Normalization · HuMan(Expedia)||How do I get a human at Expedia? · R1 Regularization · Convolution · Dense Connections · Feedforward Network · Path Length Regularization
