WorldMesh: Generating Navigable Multi-Room 3D Scenes via Mesh-Conditioned Image Diffusion
Manuel-Andreas Schneider, Angela Dai

TL;DR
This paper introduces WorldMesh, a geometry-first method for generating large-scale, immersive 3D scenes by combining mesh-based structural scaffolds with conditioned image synthesis for realistic appearance.
Contribution
It proposes a novel mesh-conditioned image diffusion approach that decouples scene structure from appearance, enabling scalable and consistent 3D scene generation.
Findings
Enables generation of large, diverse 3D scenes with high object richness.
Maintains scene and object consistency at large scales.
Produces photorealistic, environment-scale 3D worlds.
Abstract
Recent progress in image and video synthesis has inspired their use in advancing 3D scene generation. However, we observe that text-to-image and -video approaches struggle to maintain scene- and object-level consistency beyond a limited environment scale due to the absence of explicit geometry. We thus present a geometry-first approach that decouples this complex problem of large-scale 3D scene synthesis into its structural composition, represented as a mesh scaffold, and realistic appearance synthesis, which leverages powerful image synthesis models conditioned on the mesh scaffold. From an input text description, we first construct a mesh capturing the environment's geometry (walls, floors, etc.), and then use image synthesis, segmentation and object reconstruction to populate the mesh structure with objects in realistic layouts. This mesh scaffold is then rendered to condition image…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · 3D Shape Modeling and Analysis · Computer Graphics and Visualization Techniques
