DreamHuman: Animatable 3D Avatars from Text
Nikos Kolotouros, Thiemo Alldieck, Andrei Zanfir, Eduard Gabriel, Bazavan, Mihai Fieraru, Cristian Sminchisescu

TL;DR
DreamHuman is a novel method that generates high-quality, animatable 3D human avatars from textual descriptions, integrating text-to-image models, neural radiance fields, and statistical body models for realistic and diverse results.
Contribution
It introduces a new framework combining large text-to-image models, neural radiance fields, and human body models to produce animated 3D avatars from text descriptions.
Findings
Generates diverse, realistic 3D human avatars from text
Outperforms existing text-to-3D methods in visual fidelity
Produces high-quality textures and realistic animations
Abstract
We present DreamHuman, a method to generate realistic animatable 3D human avatar models solely from textual descriptions. Recent text-to-3D methods have made considerable strides in generation, but are still lacking in important aspects. Control and often spatial resolution remain limited, existing methods produce fixed rather than animated 3D human models, and anthropometric consistency for complex structures like people remains a challenge. DreamHuman connects large text-to-image synthesis models, neural radiance fields, and statistical human body models in a novel modeling and optimization framework. This makes it possible to generate dynamic 3D human avatars with high-quality textures and learned, instance-specific, surface deformations. We demonstrate that our method is capable to generate a wide variety of animatable, realistic 3D human models from text. Our 3D models have diverse…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsHuman Pose and Action Recognition · Human Motion and Animation · 3D Shape Modeling and Analysis
