RoomPlanner: Explicit Layout Planner for Easier LLM-Driven 3D Room Generation

Wenzhuo Sun; Mingjian Liang; Wenxuan Song; Xuelian Cheng; Zongyuan Ge

arXiv:2511.17048·cs.CV·November 24, 2025

RoomPlanner: Explicit Layout Planner for Easier LLM-Driven 3D Room Generation

Wenzhuo Sun, Mingjian Liang, Wenxuan Song, Xuelian Cheng, Zongyuan Ge

PDF

Open Access

TL;DR

RoomPlanner is an automatic framework that generates realistic 3D indoor scenes from short text prompts, using hierarchical language parsing, spatial arrangement constraints, and novel sampling strategies to produce high-quality, editable scenes efficiently.

Contribution

It introduces a fully automatic 3D room generation method that combines language-driven scene parsing, explicit layout constraints, and innovative sampling strategies, reducing generation time and improving quality.

Findings

01

Generates 3D indoor scenes in under 30 minutes.

02

Outperforms prior methods in rendering speed and visual quality.

03

Produces geometrically rational and editable scenes.

Abstract

In this paper, we propose RoomPlanner, the first fully automatic 3D room generation framework for painlessly creating realistic indoor scenes with only short text as input. Without any manual layout design or panoramic image guidance, our framework can generate explicit layout criteria for rational spatial placement. We begin by introducing a hierarchical structure of language-driven agent planners that can automatically parse short and ambiguous prompts into detailed scene descriptions. These descriptions include raw spatial and semantic attributes for each object and the background, which are then used to initialize 3D point clouds. To position objects within bounded environments, we implement two arrangement constraints that iteratively optimize spatial arrangements, ensuring a collision-free and accessible layout solution. In the final rendering stage, we propose a novel AnyReach…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis · Robotics and Sensor-Based Localization · Human Motion and Animation