DreamDrive: Generative 4D Scene Modeling from Street View Images
Jiageng Mao, Boyi Li, Boris Ivanovic, Yuxiao Chen, Yan Wang, Yurong, You, Chaowei Xiao, Danfei Xu, Marco Pavone, Yue Wang

TL;DR
DreamDrive introduces a novel 4D scene generation method combining generative and reconstruction techniques, enabling high-fidelity, 3D-consistent driving videos from street view images for autonomous driving applications.
Contribution
It proposes a hybrid Gaussian representation and neural rendering framework that synthesizes controllable, generalizable 4D driving scenes with 3D consistency from in-the-wild data.
Findings
Generates high-quality, controllable 4D scenes from street view images.
Achieves high fidelity and 3D consistency in synthesized driving videos.
Decomposes static and dynamic scene elements self-supervisedly.
Abstract
Synthesizing photo-realistic visual observations from an ego vehicle's driving trajectory is a critical step towards scalable training of self-driving models. Reconstruction-based methods create 3D scenes from driving logs and synthesize geometry-consistent driving videos through neural rendering, but their dependence on costly object annotations limits their ability to generalize to in-the-wild driving scenarios. On the other hand, generative models can synthesize action-conditioned driving videos in a more generalizable way but often struggle with maintaining 3D visual consistency. In this paper, we present DreamDrive, a 4D spatial-temporal scene generation approach that combines the merits of generation and reconstruction, to synthesize generalizable 4D driving scenes and dynamic driving videos with 3D consistency. Specifically, we leverage the generative power of video diffusion…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRemote Sensing and LiDAR Applications · 3D Modeling in Geospatial Applications · 3D Surveying and Cultural Heritage
MethodsDiffusion
