DreamScene: 3D Gaussian-based End-to-end Text-to-3D Scene Generation

Haoran Li; Yuli Tian; Kun Lan; Yong Liao; Lin Wang; Pan Hui; Peng Yuan Zhou

arXiv:2507.13985·cs.CV·July 30, 2025

DreamScene: 3D Gaussian-based End-to-end Text-to-3D Scene Generation

Haoran Li, Yuli Tian, Kun Lan, Yong Liao, Lin Wang, Pan Hui, Peng Yuan Zhou

PDF

TL;DR

DreamScene is an end-to-end framework that generates high-quality, editable 3D scenes from text, combining scene planning, layout, geometry synthesis, and editing for applications in gaming, film, and design.

Contribution

It introduces a novel, fully automated pipeline for text-to-3D scene generation that improves quality, consistency, and editing capabilities over prior methods.

Findings

01

Outperforms previous methods in quality and consistency

02

Supports fine-grained scene editing and dynamic motion

03

Provides a practical solution for open-domain 3D content creation

Abstract

Generating 3D scenes from natural language holds great promise for applications in gaming, film, and design. However, existing methods struggle with automation, 3D consistency, and fine-grained control. We present DreamScene, an end-to-end framework for high-quality and editable 3D scene generation from text or dialogue. DreamScene begins with a scene planning module, where a GPT-4 agent infers object semantics and spatial constraints to construct a hybrid graph. A graph-based placement algorithm then produces a structured, collision-free layout. Based on this layout, Formation Pattern Sampling (FPS) generates object geometry using multi-timestep sampling and reconstructive optimization, enabling fast and realistic synthesis. To ensure global consistent, DreamScene employs a progressive camera sampling strategy tailored to both indoor and outdoor settings. Finally, the system supports…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.