Mixed Diffusion for 3D Indoor Scene Synthesis
Siyi Hu, Diego Martin Arroyo, Stephanie Debats, Fabian, Manhardt, Luca Carlone, Federico Tombari

TL;DR
This paper introduces MiDiffusion, a novel mixed discrete-continuous diffusion model that synthesizes realistic 3D indoor scenes from floor plans, outperforming existing methods and enabling flexible scene completion.
Contribution
The paper presents MiDiffusion, the first diffusion model tailored for floor-conditioned 3D indoor scene synthesis with mixed discrete and continuous data handling.
Findings
Outperforms state-of-the-art autoregressive and diffusion models on 3D-FRONT dataset.
Effectively handles partial object constraints without task-specific training.
Demonstrates advantages in scene completion and furniture arrangement tasks.
Abstract
Generating realistic 3D scenes is an area of growing interest in computer vision and robotics. However, creating high-quality, diverse synthetic 3D content often requires expert intervention, making it costly and complex. Recently, efforts to automate this process with learning techniques, particularly diffusion models, have shown significant improvements in tasks like furniture rearrangement. However, applying diffusion models to floor-conditioned indoor scene synthesis remains under-explored. This task is especially challenging as it requires arranging objects in continuous space while selecting from discrete object categories, posing unique difficulties for conventional diffusion methods. To bridge this gap, we present MiDiffusion, a novel mixed discrete-continuous diffusion model designed to synthesize plausible 3D indoor scenes given a floor plan and pre-arranged objects. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Computer Graphics and Visualization Techniques · 3D Surveying and Cultural Heritage
MethodsSparse Evolutionary Training · Diffusion
