Language-Guided Face Animation by Recurrent StyleGAN-based Generator

Tiankai Hang; Huan Yang; Bei Liu; Jianlong Fu; Xin Geng; Baining Guo

arXiv:2208.05617·cs.CV·July 4, 2024·1 cites

Language-Guided Face Animation by Recurrent StyleGAN-based Generator

Tiankai Hang, Huan Yang, Bei Liu, Jianlong Fu, Xin Geng, Baining Guo

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel framework for language-guided face animation that leverages a recurrent motion generator and StyleGAN, enabling realistic video synthesis from a single image guided by natural language.

Contribution

It proposes a new method combining language semantics and motions with a recurrent generator and StyleGAN for high-quality face animation.

Findings

01

Effective in animating diverse face types including human, anime, and dog faces.

02

Produces high-quality, realistic videos from static images guided by language.

03

Outperforms existing methods in both qualitative and quantitative evaluations.

Abstract

Recent works on language-guided image manipulation have shown great power of language in providing rich semantics, especially for face images. However, the other natural information, motions, in language is less explored. In this paper, we leverage the motion information and study a novel task, language-guided face animation, that aims to animate a static face image with the help of languages. To better utilize both semantics and motions from languages, we propose a simple yet effective framework. Specifically, we propose a recurrent motion generator to extract a series of semantic and motion information from the language and feed it along with visual information to a pre-trained StyleGAN to generate high-quality frames. To optimize the proposed framework, three carefully designed loss functions are proposed including a regularization loss to keep the face identity, a path length…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tiankaihang/language-guided-animation
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Face recognition and analysis · Human Motion and Animation

MethodsStyleGAN · Adaptive Instance Normalization · HuMan(Expedia)||How do I get a human at Expedia? · R1 Regularization · Convolution · Dense Connections · Feedforward Network · Path Length Regularization