Open-Universe Indoor Scene Generation using LLM Program Synthesis and Uncurated Object Databases
Rio Aguina-Kang, Maxim Gumin, Do Heon Han, Stewart Morris, Seung Jean, Yoo, Aditya Ganeshan, R. Kenny Jones, Qiuhong Anna Wei, Kailiang Fu, Daniel, Ritchie

TL;DR
This paper introduces a novel indoor scene generation system that uses large language models to synthesize scene descriptions and retrieve unannotated 3D object meshes, enabling flexible, open-universe scene creation without extensive 3D training data.
Contribution
The system leverages LLMs for program synthesis and vision-language models for mesh retrieval, allowing open-universe indoor scene generation without large 3D datasets.
Findings
Outperforms traditional 3D generative models on scene quality.
Surpasses recent LLM-based layout methods in open-universe scenarios.
Effective retrieval of unannotated meshes from large databases.
Abstract
We present a system for generating indoor scenes in response to text prompts. The prompts are not limited to a fixed vocabulary of scene descriptions, and the objects in generated scenes are not restricted to a fixed set of object categories -- we call this setting indoor scene generation. Unlike most prior work on indoor scene generation, our system does not require a large training dataset of existing 3D scenes. Instead, it leverages the world knowledge encoded in pre-trained large language models (LLMs) to synthesize programs in a domain-specific layout language that describe objects and spatial relations between them. Executing such a program produces a specification of a constraint satisfaction problem, which the system solves using a gradient-based optimization scheme to produce object positions and orientations. To produce object geometry, the system retrieves 3D meshes from a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · 3D Modeling in Geospatial Applications · 3D Surveying and Cultural Heritage
MethodsSparse Evolutionary Training
