LAYOUTDREAMER: Physics-guided Layout for Text-to-3D Compositional Scene Generation

Yang Zhou; Zongjin He; Qixuan Li; Chao Wang

arXiv:2502.01949·cs.CV·March 11, 2026

LAYOUTDREAMER: Physics-guided Layout for Text-to-3D Compositional Scene Generation

Yang Zhou, Zongjin He, Qixuan Li, Chao Wang

PDF

TL;DR

LayoutDreamer introduces a physics-guided, controllable framework for text-to-3D scene generation that improves realism, compositionality, and semantic alignment using 3D Gaussian Splatting and scene graph-based adjustments.

Contribution

The paper presents LayoutDreamer, a novel framework that enhances text-guided 3D scene generation by integrating physical constraints and scene graph-based layout adjustments.

Findings

01

Outperforms existing methods in scene quality and semantic alignment.

02

Achieves state-of-the-art results on T3Bench multiple objects metric.

03

Demonstrates high physical plausibility and controllability in generated scenes.

Abstract

Recently, the field of text-guided 3D scene generation has garnered significant attention. High-quality generation that aligns with physical realism and high controllability is crucial for practical 3D scene applications. However, existing methods face fundamental limitations: (i) difficulty capturing complex relationships between multiple objects described in the text, (ii) inability to generate physically plausible scene layouts, and (iii) lack of controllability and extensibility in compositional scenes. In this paper, we introduce LayoutDreamer, a framework that leverages 3D Gaussian Splatting (3DGS) to facilitate high-quality, physically consistent compositional scene generation guided by text. Specifically, given a text prompt, we convert it into a directed scene graph and adaptively adjust the density and layout of the initial compositional 3D Gaussians. Subsequently, dynamic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.