StyleAvatar: Real-time Photo-realistic Portrait Avatar from a Single   Video

Lizhen Wang; Xiaochen Zhao; Jingxiang Sun; Yuxiang Zhang; Hongwen; Zhang; Tao Yu; Yebin Liu

arXiv:2305.00942·cs.CV·May 2, 2023·1 cites

StyleAvatar: Real-time Photo-realistic Portrait Avatar from a Single Video

Lizhen Wang, Xiaochen Zhao, Jingxiang Sun, Yuxiang Zhang, Hongwen, Zhang, Tao Yu, Yebin Liu

PDF

Open Access 1 Repo

TL;DR

StyleAvatar is a real-time, high-fidelity portrait avatar reconstruction method that combines StyleGAN, compositional representation, and innovative augmentation techniques to enable controllable, photo-realistic video re-animation.

Contribution

It introduces a novel StyleGAN-based framework with compositional and sliding window augmentation methods for fast, high-quality, controllable portrait video generation in real-time.

Findings

01

Achieves high-quality, photo-realistic portrait videos in real-time

02

Converges within two hours with high image fidelity

03

Outperforms existing facial reenactment methods in quality and controllability

Abstract

Face reenactment methods attempt to restore and re-animate portrait videos as realistically as possible. Existing methods face a dilemma in quality versus controllability: 2D GAN-based methods achieve higher image quality but suffer in fine-grained control of facial attributes compared with 3D counterparts. In this work, we propose StyleAvatar, a real-time photo-realistic portrait avatar reconstruction method using StyleGAN-based networks, which can generate high-fidelity portrait avatars with faithful expression control. We expand the capabilities of StyleGAN by introducing a compositional representation and a sliding window augmentation method, which enable faster convergence and improve translation generalization. Specifically, we divide the portrait scenes into three parts for adaptive adjustments: facial region, non-facial foreground region, and the background. Besides, our network…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lizhenwangt/styleavatar
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Generative Adversarial Networks and Image Synthesis · Advanced Image Processing Techniques

MethodsHuMan(Expedia)||How do I get a human at Expedia? · Dense Connections · R1 Regularization · Convolution · Adaptive Instance Normalization · Feedforward Network · StyleGAN