SPREAD: Spatial-Physical REasoning via geometry Aware Diffusion

Minzhang Li; Kuixiang Shao; Xuebing Li; Yuyang Jiao; Yinuo Bai; Hengan Zhou; Sixian Shen; Jiayuan Gu; Jingyi Yu

arXiv:2603.27573·cs.GR·March 31, 2026

SPREAD: Spatial-Physical REasoning via geometry Aware Diffusion

Minzhang Li, Kuixiang Shao, Xuebing Li, Yuyang Jiao, Yinuo Bai, Hengan Zhou, Sixian Shen, Jiayuan Gu, Jingyi Yu

PDF

TL;DR

SPREAD is a diffusion-based framework that models spatial and physical relationships in 3D scene generation, ensuring realistic, collision-free, and physics-coherent environments for AI applications.

Contribution

It introduces a geometry-aware diffusion model with differentiable guidance for physical constraints, advancing the realism and stability of generated 3D scenes.

Findings

01

Achieves state-of-the-art performance in spatial-relational reasoning.

02

Outperforms baselines in scene consistency and stability during physics simulation.

03

Generates simulation-ready environments for embodied AI.

Abstract

Automated 3D scene generation is pivotal for applications spanning virtual reality, digital content creation, and Embodied AI. While computer graphics prioritizes aesthetic layouts, vision and robotics demand scenes that mirror real-world complexity which current data-driven methods struggle to achieve due to limited unstructured training data and insufficient spatial and physical modeling. We propose SPREAD, a diffusion-based framework that jointly learns spatial and physical relationships through a graph transformer, explicitly conditioning on posed scene point clouds for geometric awareness. Moreover, our model integrates differentiable guidance for collision avoidance, relational constraint, and gravity, ensuring physically coherent scenes without sacrificing relational context. Our experiments on 3D-FRONT and ProcTHOR datasets demonstrate state-of-the-art performance in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.