EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion
Guangyao Zhai, Evin P{\i}nar \"Ornek, Dave Zhenyu Chen, Ruotong Liao,, Yan Di, Nassir Navab, Federico Tombari, Benjamin Busam

TL;DR
EchoScene introduces a novel dual-branch diffusion model with an information echo scheme for controllable, high-quality 3D indoor scene generation from scene graphs, effectively handling complex graph structures and enabling scene manipulation.
Contribution
It proposes a dual-branch diffusion framework with an information echo scheme that improves scene graph-based 3D scene generation, ensuring global coherence and controllability.
Findings
Outperforms previous methods in scene fidelity and controllability.
Generates high-quality scenes compatible with texture generation.
Enables scene editing during inference through scene graph manipulation.
Abstract
We present EchoScene, an interactive and controllable generative model that generates 3D indoor scenes on scene graphs. EchoScene leverages a dual-branch diffusion model that dynamically adapts to scene graphs. Existing methods struggle to handle scene graphs due to varying numbers of nodes, multiple edge combinations, and manipulator-induced node-edge operations. EchoScene overcomes this by associating each node with a denoising process and enables collaborative information exchange, enhancing controllable and consistent generation aware of global constraints. This is achieved through an information echo scheme in both shape and layout branches. At every denoising step, all processes share their denoising data with an information exchange unit that combines these updates using graph convolution. The scheme ensures that the denoising processes are influenced by a holistic understanding…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Video Surveillance and Tracking Methods · Image Retrieval and Classification Techniques
MethodsContrastive Language-Image Pre-training · Graph Convolutional Network · Diffusion
