Instant 3D Human Avatar Generation using Image Diffusion Models
Nikos Kolotouros, Thiemo Alldieck, Enric Corona, Eduard Gabriel, Bazavan, Cristian Sminchisescu

TL;DR
AvatarPopUp is a fast, high-quality 3D human avatar generation method leveraging diffusion models and 3D lifting, enabling diverse, multimodal control with unprecedented speed.
Contribution
The paper introduces a decoupled diffusion-based approach for rapid, high-quality 3D human avatar creation from images and text, with minimal fine-tuning and broad control capabilities.
Findings
Produces 3D avatars in as little as 2 seconds.
Achieves high-quality, diverse avatars respecting multimodal inputs.
Outperforms existing methods in speed by four orders of magnitude.
Abstract
We present AvatarPopUp, a method for fast, high quality 3D human avatar generation from different input modalities, such as images and text prompts and with control over the generated pose and shape. The common theme is the use of diffusion-based image generation networks that are specialized for each particular task, followed by a 3D lifting network. We purposefully decouple the generation from the 3D modeling which allow us to leverage powerful image synthesis priors, trained on billions of text-image pairs. We fine-tune latent diffusion networks with additional image conditioning for image generation and back-view prediction, and to support qualitatively different multiple 3D hypotheses. Our partial fine-tuning approach allows to adapt the networks for each task without inducing catastrophic forgetting. In our experiments, we demonstrate that our method produces accurate,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Motion and Animation · 3D Shape Modeling and Analysis · Human Pose and Action Recognition
MethodsDiffusion
