AvatarGen: A 3D Generative Model for Animatable Human Avatars
Jianfeng Zhang, Zihang Jiang, Dingdong Yang, Hongyi Xu and, Yichun Shi, Guoxian Song, Zhongcong Xu, Xinchao Wang, Jiashi Feng

TL;DR
AvatarGen is a novel 3D generative model that creates high-quality, animatable human avatars from 2D images, enabling detailed control over geometry and appearance for AR/VR applications.
Contribution
It introduces a geometry-aware, disentangled 3D human synthesis method that only requires 2D images for training, with explicit pose and shape control using a 3D parametric model.
Findings
Outperforms previous 3D GANs in avatar quality and animation.
Enables applications like single-view reconstruction and text-guided editing.
Achieves high-fidelity appearance and detailed geometric modeling.
Abstract
Unsupervised generation of 3D-aware clothed humans with various appearances and controllable geometries is important for creating virtual human avatars and other AR/VR applications. Existing methods are either limited to rigid object modeling, or not generative and thus unable to generate high-quality virtual humans and animate them. In this work, we propose AvatarGen, the first method that enables not only geometry-aware clothed human synthesis with high-fidelity appearances but also disentangled human animation controllability, while only requiring 2D images for training. Specifically, we decompose the generative 3D human synthesis into pose-guided mapping and canonical representation with predefined human pose and shape, such that the canonical representation can be explicitly driven to different poses and shapes with the guidance of a 3D parametric human model SMPL. AvatarGen…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Human Motion and Animation · Human Pose and Action Recognition
