Scaling Up Occupancy-centric Driving Scene Generation: Dataset and Method
Bohan Li, Xin Jin, Hu Zhu, Hongsi Liu, Ruikai Li, Jiazhe Guo, Kaiwen Cai, Chao Ma, Yueming Jin, Hao Zhao, Xiaokang Yang, Wenjun Zeng

TL;DR
This paper introduces Nuplan-Occ, the largest semantic occupancy dataset for autonomous driving, and proposes a unified framework for high-fidelity, multi-modal scene generation that advances scalability and practical application.
Contribution
It presents a large-scale occupancy dataset and a novel unified generative framework with techniques for multi-view, LiDAR, and dynamic scene synthesis.
Findings
Achieves superior generation fidelity and scalability.
Validates effectiveness in downstream autonomous driving tasks.
Introduces novel rendering and sensor-aware embedding techniques.
Abstract
Driving scene generation is a critical domain for autonomous driving, enabling downstream applications, including perception and planning evaluation. Occupancy-centric methods have recently achieved state-of-the-art results by offering consistent conditioning across frames and modalities; however, their performance heavily depends on annotated occupancy data, which still remains scarce. To overcome this limitation, we curate Nuplan-Occ, the largest semantic occupancy dataset to date, constructed from the widely used Nuplan benchmark. Its scale and diversity facilitate not only large-scale generative modeling but also autonomous driving downstream applications. Based on this dataset, we develop a unified framework that jointly synthesizes high-quality semantic occupancy, multi-view videos, and LiDAR point clouds. Our approach incorporates a spatio-temporal disentangled architecture to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
