Pair2Scene: Learning Local Object Relations for Procedural Scene Generation
Xingjian Ran, Shujie Zhang, Weipeng Zhong, Li Luo, Bo Dai

TL;DR
Pair2Scene is a procedural scene generation framework that learns local object relations to create complex, plausible 3D indoor scenes beyond training data, integrating spatial reasoning with scene hierarchies.
Contribution
It introduces a novel method combining learned local rules, scene hierarchies, and physics-based algorithms for scalable, high-fidelity 3D scene generation.
Findings
Outperforms existing methods in generating complex environments
Successfully models support and functional object relations
Maintains physical and semantic plausibility in generated scenes
Abstract
Generating high-fidelity 3D indoor scenes remains a significant challenge due to data scarcity and the complexity of modeling intricate spatial relations. Current methods often struggle to scale beyond training distribution to dense scenes or rely on LLMs/VLMs that lack the ability for precise spatial reasoning. Building on top of the observation that object placement relies mainly on local dependencies instead of information-redundant global distributions, in this paper, we propose Pair2Scene, a novel procedural generation framework that integrates learned local rules with scene hierarchies and physics-based algorithms. These rules mainly capture two types of inter-object relations, namely support relations that follow physical hierarchies, and functional relations that reflect semantic links. We model these rules through a network, which estimates spatial position distributions of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
