DreamScene360: Unconstrained Text-to-3D Scene Generation with Panoramic Gaussian Splatting
Shijie Zhou, Zhiwen Fan, Dejia Xu, Haoran Chang, Pradyumna Chari,, Tejas Bharadwaj, Suya You, Zhangyang Wang, Achuta Kadambi

TL;DR
DreamScene360 introduces a rapid pipeline for generating immersive 360-degree 3D scenes from text prompts, combining 2D diffusion models, depth alignment, and Gaussian splatting for real-time exploration.
Contribution
It presents a novel method that converts text into globally coherent 3D panoramic scenes using a combination of diffusion models, depth alignment, and Gaussian splatting techniques.
Findings
Generates high-quality 360° scenes in minutes
Ensures global coherence with depth and semantic constraints
Enables real-time exploration of reconstructed scenes
Abstract
The increasing demand for virtual reality applications has highlighted the significance of crafting immersive 3D assets. We present a text-to-3D 360 scene generation pipeline that facilitates the creation of comprehensive 360 scenes for in-the-wild environments in a matter of minutes. Our approach utilizes the generative power of a 2D diffusion model and prompt self-refinement to create a high-quality and globally coherent panoramic image. This image acts as a preliminary "flat" (2D) scene representation. Subsequently, it is lifted into 3D Gaussians, employing splatting techniques to enable real-time exploration. To produce consistent 3D geometry, our pipeline constructs a spatially coherent structure by aligning the 2D monocular depth into a globally optimized point cloud. This point cloud serves as the initial state for the centroids of 3D Gaussians. In order to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Motion and Animation · Image Processing and 3D Reconstruction · Video Analysis and Summarization
MethodsDiffusion
