ToonifyGB: StyleGAN-based Gaussian Blendshapes for 3D Stylized Head Avatars
Rui-Yang Ju, Sheng-Yen Huang, Yi-Ping Hung

TL;DR
ToonifyGB is a two-stage framework that extends StyleGAN-based stylization to 3D Gaussian blendshapes, enabling the creation of diverse, high-quality stylized 3D head avatars with arbitrary expressions from monocular videos.
Contribution
It introduces an improved StyleGAN for stable stylized video generation and a method to learn stylized 3D Gaussian blendshapes for animatable avatars.
Findings
Effective stylized video generation with improved StyleGAN.
Successful synthesis of stylized 3D head avatars with arbitrary expressions.
Validated on benchmark datasets with styles like Arcane and Pixar.
Abstract
The introduction of 3D Gaussian blendshapes has enabled the real-time reconstruction of animatable head avatars from monocular video. Toonify, a StyleGAN-based method, has become widely used for facial image stylization. To extend Toonify for synthesizing diverse stylized 3D head avatars using Gaussian blendshapes, we propose an efficient two-stage framework, ToonifyGB. In Stage 1 (stylized video generation), we adopt an improved StyleGAN to generate the stylized video from the input video frames, which overcomes the limitation of cropping aligned faces at a fixed resolution as preprocessing for normal StyleGAN. This process provides a more stable stylized video, which enables Gaussian blendshapes to better capture the high-frequency details of the video frames, facilitating the synthesis of high-quality animations in the next stage. In Stage 2 (Gaussian blendshapes synthesis), our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Human Motion and Animation · Face recognition and analysis
MethodsHuMan(Expedia)||How do I get a human at Expedia? · Convolution · Dense Connections · R1 Regularization · Sparse Evolutionary Training · Adaptive Instance Normalization · Feedforward Network · StyleGAN
