OccSim: Multi-kilometer Simulation with Long-horizon Occupancy World Models
Tianran Liu, Shengwen Zhao, Mozhgan Pourkeshavarz, Weican Li, Nicholas Rhinehart

TL;DR
OccSim is a novel occupancy world model-driven 3D simulator capable of generating over 4 kilometers of continuous, diverse simulation streams without relying on pre-recorded logs or HD maps, significantly enhancing scalability.
Contribution
This work introduces OccSim, the first occupancy world model-based simulator that enables long-horizon, large-scale 3D environment generation for autonomous driving simulation.
Findings
Achieves over 80x longer stable generation than previous models.
Pre-training with OccSim data improves zero-shot forecasting performance by up to 74%.
Scaling OccSim dataset further enhances model performance and diversity.
Abstract
Data-driven autonomous driving simulation has long been constrained by its heavy reliance on pre-recorded driving logs or spatial priors, such as HD maps. This fundamental dependency severely limits scalability, restricting open-ended generation capabilities to the finite scale of existing collected datasets. To break this bottleneck, we present OccSim, the first occupancy world model-driven 3D simulator. OccSim obviates the requirement for continuous logs or HD maps; conditioned only on a single initial frame and a sequence of future ego-actions, it can stably generate over 3,000 continuous frames, enabling the continuous construction of large-scale 3D occupancy maps spanning over 4 kilometers for simulation. This represents an >80x improvement in stable generation length over previous state-of-the-art occupancy world models. OccSim is powered by two modules: W-DiT based static…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
