OptiScene: LLM-driven Indoor Scene Layout Generation via Scaled Human-aligned Data Synthesis and Multi-Stage Preference Optimization
Yixuan Yang, Zhen Luo, Tongsheng Ding, Junru Lu, Mingqi Gao, Jinyu Yang, Victor Sanchez, Feng Zheng

TL;DR
This paper introduces OptiScene, an LLM fine-tuned on a large synthetic indoor layout dataset, achieving improved generation quality and human preference alignment for diverse room types.
Contribution
The paper presents a new large-scale synthetic dataset, 3D-SynthPlace, and a fine-tuned LLM, OptiScene, with a two-stage training process including preference optimization, advancing indoor scene layout generation.
Findings
OptiScene outperforms existing prompt-driven and learning-based methods.
The dataset covers four common room types with diverse object arrangements.
OptiScene demonstrates effectiveness in scene editing and robot navigation tasks.
Abstract
Automatic indoor layout generation has attracted increasing attention due to its potential in interior design, virtual environment construction, and embodied AI. Existing methods fall into two categories: prompt-driven approaches that leverage proprietary LLM services (e.g., GPT APIs) and learning-based methods trained on layout data upon diffusion-based models. Prompt-driven methods often suffer from spatial inconsistency and high computational costs, while learning-based methods are typically constrained by coarse relational graphs and limited datasets, restricting their generalization to diverse room categories. In this paper, we revisit LLM-based indoor layout generation and present 3D-SynthPlace, a large-scale dataset that combines synthetic layouts generated via a 'GPT synthesize, Human inspect' pipeline, upgraded from the 3D-Front dataset. 3D-SynthPlace contains nearly 17,000…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Surveying and Cultural Heritage · Remote Sensing and LiDAR Applications · Video Surveillance and Tracking Methods
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Cosine Annealing · Linear Layer · Dense Connections · Layer Normalization · Adam · Attention Is All You Need · ADaptive gradient method with the OPTimal convergence rate · Linear Warmup With Cosine Annealing · Attention Dropout
