Style Transfer for 2D Talking Head Animation

Trong-Thang Pham; Nhat Le; Tuong Do; Hung Nguyen; Erman Tjiputra,; Quang D. Tran; Anh Nguyen

arXiv:2303.09799·cs.CV·March 23, 2023·1 cites

Style Transfer for 2D Talking Head Animation

Trong-Thang Pham, Nhat Le, Tuong Do, Hung Nguyen, Erman Tjiputra,, Quang D. Tran, Anh Nguyen

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel method for 2D talking head animation that learns and transfers styles from reference images to generate realistic animations from a single image and audio.

Contribution

It presents a new style-aware framework capable of reconstructing personalized talking head animations with style transfer from minimal input.

Findings

01

Outperforms recent state-of-the-art methods in quality and fidelity

02

Successfully transfers styles to new static images

03

Produces photo-realistic 2D talking head animations

Abstract

Audio-driven talking head animation is a challenging research topic with many real-world applications. Recent works have focused on creating photo-realistic 2D animation, while learning different talking or singing styles remains an open problem. In this paper, we present a new method to generate talking head animation with learnable style references. Given a set of style reference frames, our framework can reconstruct 2D talking head animation based on a single input image and an audio stream. Our method first produces facial landmarks motion from the audio stream and constructs the intermediate style patterns from the style reference images. We then feed both outputs into a style-aware image generator to generate the photo-realistic and fidelity 2D animation. In practice, our framework can extract the style information of a specific character and transfer it to any new static image…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

aioz-ai/audiodrivenstyletransfer
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Generative Adversarial Networks and Image Synthesis · Human Motion and Animation