YouDream: Generating Anatomically Controllable Consistent Text-to-3D   Animals

Sandeep Mishra; Oindrila Saha; Alan C. Bovik

arXiv:2406.16273·cs.CV·June 25, 2024

YouDream: Generating Anatomically Controllable Consistent Text-to-3D Animals

Sandeep Mishra, Oindrila Saha, Alan C. Bovik

PDF

Open Access 1 Video

TL;DR

YouDream is a novel method for generating high-quality, anatomically consistent 3D animals guided by text and 2D views, surpassing previous methods in controllability and realism.

Contribution

We introduce YouDream, a new approach that combines text-to-image diffusion models with 3D pose control to generate anatomically accurate 3D animals without manual intervention.

Findings

01

Generated animals are more anatomically consistent.

02

User preference favors YouDream over previous methods.

03

The pipeline automates 3D animal creation effectively.

Abstract

3D generation guided by text-to-image diffusion models enables the creation of visually compelling assets. However previous methods explore generation based on image or text. The boundaries of creativity are limited by what can be expressed through words or the images that can be sourced. We present YouDream, a method to generate high-quality anatomically controllable animals. YouDream is guided using a text-to-image diffusion model controlled by 2D views of a 3D pose prior. Our method generates 3D animals that are not possible to create using previous text-to-3D generative methods. Additionally, our method is capable of preserving anatomic consistency in the generated animals, an area where prior text-to-3D approaches often struggle. Moreover, we design a fully automated pipeline for generating commonly found animals. To circumvent the need for human intervention to create a 3D pose,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

YouDream: Generating Anatomically Controllable Consistent Text-to-3D Animals· slideslive

Taxonomy

TopicsImage Processing and 3D Reconstruction · Human Motion and Animation

MethodsLib · Diffusion