C3DAG: Controlled 3D Animal Generation using 3D pose guidance
Sandeep Mishra, Oindrila Saha, Alan C. Bovik

TL;DR
C3DAG introduces a novel framework for generating high-quality, anatomically accurate 3D animals from text prompts with precise pose control, enhancing realism and customization over previous methods.
Contribution
The paper presents a new pose-controlled text-to-3D animal generation method and an automatic shape creation tool, improving anatomical accuracy and pose flexibility.
Findings
Produces geometrically and anatomically consistent 3D animals
Allows fine-grained pose control in 3D animal generation
Generates high-quality, customizable 3D assets
Abstract
Recent advancements in text-to-3D generation have demonstrated the ability to generate high quality 3D assets. However while generating animals these methods underperform, often portraying inaccurate anatomy and geometry. Towards ameliorating this defect, we present C3DAG, a novel pose-Controlled text-to-3D Animal Generation framework which generates a high quality 3D animal consistent with a given pose. We also introduce an automatic 3D shape creator tool, that allows dynamic pose generation and modification via a web-based tool, and that generates a 3D balloon animal using simple geometries. A NeRF is then initialized using this 3D shape using depth-controlled SDS. In the next stage, the pre-trained NeRF is fine-tuned using quadruped-pose-controlled SDS. The pipeline that we have developed not only produces geometrically and anatomically consistent results, but also renders highly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Human Motion and Animation · Advanced Image and Video Retrieval Techniques
