StyleAvatar: Real-time Photo-realistic Portrait Avatar from a Single Video
Lizhen Wang, Xiaochen Zhao, Jingxiang Sun, Yuxiang Zhang, Hongwen, Zhang, Tao Yu, Yebin Liu

TL;DR
StyleAvatar is a real-time, high-fidelity portrait avatar reconstruction method that combines StyleGAN, compositional representation, and innovative augmentation techniques to enable controllable, photo-realistic video re-animation.
Contribution
It introduces a novel StyleGAN-based framework with compositional and sliding window augmentation methods for fast, high-quality, controllable portrait video generation in real-time.
Findings
Achieves high-quality, photo-realistic portrait videos in real-time
Converges within two hours with high image fidelity
Outperforms existing facial reenactment methods in quality and controllability
Abstract
Face reenactment methods attempt to restore and re-animate portrait videos as realistically as possible. Existing methods face a dilemma in quality versus controllability: 2D GAN-based methods achieve higher image quality but suffer in fine-grained control of facial attributes compared with 3D counterparts. In this work, we propose StyleAvatar, a real-time photo-realistic portrait avatar reconstruction method using StyleGAN-based networks, which can generate high-fidelity portrait avatars with faithful expression control. We expand the capabilities of StyleGAN by introducing a compositional representation and a sliding window augmentation method, which enable faster convergence and improve translation generalization. Specifically, we divide the portrait scenes into three parts for adaptive adjustments: facial region, non-facial foreground region, and the background. Besides, our network…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Generative Adversarial Networks and Image Synthesis · Advanced Image Processing Techniques
MethodsHuMan(Expedia)||How do I get a human at Expedia? · Dense Connections · R1 Regularization · Convolution · Adaptive Instance Normalization · Feedforward Network · StyleGAN
