Sat2Scene: 3D Urban Scene Generation from Satellite Images with Diffusion
Zuoyue Li, Zhenqiang Li, Zhaopeng Cui, Marc Pollefeys, Martin R., Oswald

TL;DR
This paper introduces Sat2Scene, a novel 3D scene generation method from satellite images using diffusion models and neural rendering, enabling realistic multi-view urban scene synthesis.
Contribution
It presents a new architecture combining diffusion models with neural rendering for direct 3D scene generation from satellite imagery, addressing view change and scale challenges.
Findings
Generates photo-realistic street-view sequences.
Produces consistent multi-view urban scenes.
Effective on city-scale datasets.
Abstract
Directly generating scenes from satellite imagery offers exciting possibilities for integration into applications like games and map services. However, challenges arise from significant view changes and scene scale. Previous efforts mainly focused on image or video generation, lacking exploration into the adaptability of scene generation for arbitrary views. Existing 3D generation works either operate at the object level or are difficult to utilize the geometry obtained from satellite imagery. To overcome these limitations, we propose a novel architecture for direct 3D scene generation by introducing diffusion models into 3D sparse representations and combining them with neural rendering techniques. Specifically, our approach generates texture colors at the point level for a given geometry using a 3D diffusion model first, which is then transformed into a scene representation in a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Remote Sensing and LiDAR Applications · Computer Graphics and Visualization Techniques
MethodsDiffusion
