DreamDissector: Learning Disentangled Text-to-3D Generation from 2D Diffusion Priors
Zizheng Yan, Jiapeng Zhou, Fanpeng Meng, Yushuang Wu, Lingteng Qiu,, Zisheng Ye, Shuguang Cui, Guanying Chen, Xiaoguang Han

TL;DR
DreamDissector introduces a novel method for generating multiple independent and interacting objects in 3D scenes from text prompts, advancing the realism and controllability of text-to-3D generation.
Contribution
It proposes Neural Category Field and Category Score Distillation Sampling to disentangle and generate multi-object 3D scenes, addressing limitations of existing methods.
Findings
Successfully generates multi-object 3D scenes with interactions
Enables object-level control in 3D synthesis
Outperforms existing methods in realism and diversity
Abstract
Text-to-3D generation has recently seen significant progress. To enhance its practicality in real-world applications, it is crucial to generate multiple independent objects with interactions, similar to layer-compositing in 2D image editing. However, existing text-to-3D methods struggle with this task, as they are designed to generate either non-independent objects or independent objects lacking spatially plausible interactions. Addressing this, we propose DreamDissector, a text-to-3D method capable of generating multiple independent objects with interactions. DreamDissector accepts a multi-object text-to-3D NeRF as input and produces independent textured meshes. To achieve this, we introduce the Neural Category Field (NeCF) for disentangling the input NeRF. Additionally, we present the Category Score Distillation Sampling (CSDS), facilitated by a Deep Concept Mining (DCM) module, to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsDiffusion
