DreamPartGen: Semantically Grounded Part-Level 3D Generation via Collaborative Latent Denoising

Tianjiao Yu; Xinzhuo Li; Muntasir Wahed; Jerry Xiong; Yifan Shen; Ying Shen; Ismini Lourentzou

arXiv:2603.19216·cs.CV·March 20, 2026

DreamPartGen: Semantically Grounded Part-Level 3D Generation via Collaborative Latent Denoising

Tianjiao Yu, Xinzhuo Li, Muntasir Wahed, Jerry Xiong, Yifan Shen, Ying Shen, Ismini Lourentzou

PDF

Open Access

TL;DR

DreamPartGen is a novel framework that generates semantically grounded, part-aware 3D objects from text by jointly modeling geometry, appearance, and inter-part relations, achieving state-of-the-art results.

Contribution

It introduces Duplex Part Latents and Relational Semantic Latents for joint geometric and semantic modeling in text-to-3D generation.

Findings

01

Achieves state-of-the-art geometric fidelity.

02

Demonstrates superior text-shape alignment.

03

Produces coherent and interpretable 3D models.

Abstract

Understanding and generating 3D objects as compositions of meaningful parts is fundamental to human perception and reasoning. However, most text-to-3D methods overlook the semantic and functional structure of parts. While recent part-aware approaches introduce decomposition, they remain largely geometry-focused, lacking semantic grounding and failing to model how parts align with textual descriptions or their inter-part relations. We propose DreamPartGen, a framework for semantically grounded, part-aware text-to-3D generation. DreamPartGen introduces Duplex Part Latents (DPLs) that jointly model each part's geometry and appearance, and Relational Semantic Latents (RSLs) that capture inter-part dependencies derived from language. A synchronized co-denoising process enforces mutual geometric and semantic consistency, enabling coherent, interpretable, and text-aligned 3D synthesis. Across…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis · Human Motion and Animation · Generative Adversarial Networks and Image Synthesis