R$^3$L: Reasoning 3D Layouts from Relative Spatial Relations

Zhifeng Gu; Yuqi Wang; Bing Wang

arXiv:2605.06758·cs.CV·May 20, 2026

R$^3$L: Reasoning 3D Layouts from Relative Spatial Relations

Zhifeng Gu, Yuqi Wang, Bing Wang

PDF

1 Repo

TL;DR

R3L introduces a framework that enhances the reliability of 3D layout generation by addressing errors in multi-hop relative spatial reasoning through invariant decomposition, self-consistency, and spatial optimization.

Contribution

It proposes novel methods for invariant spatial decomposition and consistent spatial imagination to improve multi-hop reasoning in 3D layout generation.

Findings

01

Produces more physically feasible layouts

02

Achieves higher semantic consistency

03

Addresses frame-induced reasoning errors

Abstract

Relative spatial relations provide a compact representation of spatial structure and are fundamental to relative spatial reasoning in 3D layout generation. Recent works leverage Multimodal Large Language Models (MLLMs) to infer such relations, but the inferred relations are often unreliable and are typically handled with post-hoc heuristics. In this paper, we propose R $^{3}$ L, a general framework that improves the reliability and consistency of relative spatial reasoning for 3D layout generation. Our key motivation is that multi-hop reasoning requires repeated reference-frame transformations, which accumulate errors in inferred relations and lead to semantic and metric drift. To mitigate this, we propose invariant spatial decomposition to break coupled relation chains, and consistent spatial imagination to promote self-consistency through an imagine-and-revise loop. We further introduce…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Neal2020GitHub/R3L
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.