MMGDreamer: Mixed-Modality Graph for Geometry-Controllable 3D Indoor   Scene Generation

Zhifei Yang; Keyang Lu; Chao Zhang; Jiaxing Qi; Hanqi Jiang; Ruifei; Ma; Shenglin Yin; Yifan Xu; Mingzhe Xing; Zhen Xiao; Jieyi Long; Guangyao; Zhai

arXiv:2502.05874·cs.CV·March 27, 2025

MMGDreamer: Mixed-Modality Graph for Geometry-Controllable 3D Indoor Scene Generation

Zhifei Yang, Keyang Lu, Chao Zhang, Jiaxing Qi, Hanqi Jiang, Ruifei, Ma, Shenglin Yin, Yifan Xu, Mingzhe Xing, Zhen Xiao, Jieyi Long, Guangyao, Zhai

PDF

Open Access 1 Repo 1 Video

TL;DR

MMGDreamer introduces a dual-branch diffusion model with a mixed-modality graph to enable precise, flexible, and high-fidelity 3D indoor scene generation with controllable object geometry.

Contribution

It proposes a novel mixed-modality graph and visual enhancement modules for improved control and adaptability in 3D scene generation.

Findings

01

Achieves state-of-the-art scene generation performance.

02

Demonstrates superior control over object geometry.

03

Enables flexible user input integration.

Abstract

Controllable 3D scene generation has extensive applications in virtual reality and interior design, where the generated scenes should exhibit high levels of realism and controllability in terms of geometry. Scene graphs provide a suitable data representation that facilitates these applications. However, current graph-based methods for scene generation are constrained to text-based inputs and exhibit insufficient adaptability to flexible user inputs, hindering the ability to precisely control object geometry. To address this issue, we propose MMGDreamer, a dual-branch diffusion model for scene generation that incorporates a novel Mixed-Modality Graph, visual enhancement module, and relation predictor. The mixed-modality graph allows object nodes to integrate textual and visual modalities, with optional relationships between nodes. It enhances adaptability to flexible user inputs and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yangzhifeio/MMGDreamer
pytorchOfficial

Videos

MMGDreamer: Mixed-Modality Graph for Geometry-Controllable 3D Indoor Scene Generation· underline

Taxonomy

Topics3D Surveying and Cultural Heritage · 3D Shape Modeling and Analysis · Computer Graphics and Visualization Techniques

MethodsDiffusion