Muses: Designing, Composing, Generating Nonexistent Fantasy 3D Creatures without Training

Hexiao Lu; Xiaokun Sun; Zeyu Cai; Hao Guo; Ying Tai; Jian Yang; Zhenyu Zhang

arXiv:2601.03256·cs.CV·January 7, 2026

Muses: Designing, Composing, Generating Nonexistent Fantasy 3D Creatures without Training

Hexiao Lu, Xiaokun Sun, Zeyu Cai, Hao Guo, Ying Tai, Jian Yang, Zhenyu Zhang

PDF

Open Access

TL;DR

Muses introduces a novel, training-free approach for creating diverse, realistic fantasy 3D creatures by leveraging skeletal structures and a structured pipeline, outperforming prior methods in fidelity and coherence.

Contribution

This work presents the first training-free, skeleton-guided pipeline for 3D creature generation, enabling flexible, high-quality, and out-of-domain 3D asset creation without manual intervention.

Findings

01

Achieves state-of-the-art visual fidelity and textual alignment.

02

Enables flexible 3D object editing.

03

Outperforms existing methods in realism and coherence.

Abstract

We present Muses, the first training-free method for fantastic 3D creature generation in a feed-forward paradigm. Previous methods, which rely on part-aware optimization, manual assembly, or 2D image generation, often produce unrealistic or incoherent 3D assets due to the challenges of intricate part-level manipulation and limited out-of-domain generation. In contrast, Muses leverages the 3D skeleton, a fundamental representation of biological forms, to explicitly and rationally compose diverse elements. This skeletal foundation formalizes 3D content creation as a structure-aware pipeline of design, composition, and generation. Muses begins by constructing a creatively composed 3D skeleton with coherent layout and scale through graph-constrained reasoning. This skeleton then guides a voxel-based assembly process within a structured latent space, integrating regions from different…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis · Generative Adversarial Networks and Image Synthesis · Human Motion and Animation