YouDream: Generating Anatomically Controllable Consistent Text-to-3D Animals
Sandeep Mishra, Oindrila Saha, Alan C. Bovik

TL;DR
YouDream is a novel method for generating high-quality, anatomically consistent 3D animals guided by text and 2D views, surpassing previous methods in controllability and realism.
Contribution
We introduce YouDream, a new approach that combines text-to-image diffusion models with 3D pose control to generate anatomically accurate 3D animals without manual intervention.
Findings
Generated animals are more anatomically consistent.
User preference favors YouDream over previous methods.
The pipeline automates 3D animal creation effectively.
Abstract
3D generation guided by text-to-image diffusion models enables the creation of visually compelling assets. However previous methods explore generation based on image or text. The boundaries of creativity are limited by what can be expressed through words or the images that can be sourced. We present YouDream, a method to generate high-quality anatomically controllable animals. YouDream is guided using a text-to-image diffusion model controlled by 2D views of a 3D pose prior. Our method generates 3D animals that are not possible to create using previous text-to-3D generative methods. Additionally, our method is capable of preserving anatomic consistency in the generated animals, an area where prior text-to-3D approaches often struggle. Moreover, we design a fully automated pipeline for generating commonly found animals. To circumvent the need for human intervention to create a 3D pose,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsImage Processing and 3D Reconstruction · Human Motion and Animation
MethodsLib · Diffusion
