C3DAG: Controlled 3D Animal Generation using 3D pose guidance

Sandeep Mishra; Oindrila Saha; Alan C. Bovik

arXiv:2406.07742·cs.CV·June 13, 2024

C3DAG: Controlled 3D Animal Generation using 3D pose guidance

Sandeep Mishra, Oindrila Saha, Alan C. Bovik

PDF

Open Access

TL;DR

C3DAG introduces a novel framework for generating high-quality, anatomically accurate 3D animals from text prompts with precise pose control, enhancing realism and customization over previous methods.

Contribution

The paper presents a new pose-controlled text-to-3D animal generation method and an automatic shape creation tool, improving anatomical accuracy and pose flexibility.

Findings

01

Produces geometrically and anatomically consistent 3D animals

02

Allows fine-grained pose control in 3D animal generation

03

Generates high-quality, customizable 3D assets

Abstract

Recent advancements in text-to-3D generation have demonstrated the ability to generate high quality 3D assets. However while generating animals these methods underperform, often portraying inaccurate anatomy and geometry. Towards ameliorating this defect, we present C3DAG, a novel pose-Controlled text-to-3D Animal Generation framework which generates a high quality 3D animal consistent with a given pose. We also introduce an automatic 3D shape creator tool, that allows dynamic pose generation and modification via a web-based tool, and that generates a 3D balloon animal using simple geometries. A NeRF is then initialized using this 3D shape using depth-controlled SDS. In the next stage, the pre-trained NeRF is fine-tuned using quadruped-pose-controlled SDS. The pipeline that we have developed not only produces geometrically and anatomically consistent results, but also renders highly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization · Human Motion and Animation · Advanced Image and Video Retrieval Techniques