COME: Adding Scene-Centric Forecasting Control to Occupancy World Model
Yining Shi, Kun Jiang, Qiang Meng, Ke Wang, Jiabao Wang, Wenchao Sun, Tuopu Wen, Mengmeng Yang, Diange Yang

TL;DR
COME introduces a scene-centric forecasting control framework that improves occupancy prediction accuracy by disentangling ego-motion from environmental changes, leading to significant performance gains on the nuScenes-Occ3D dataset.
Contribution
The paper presents a novel scene-centric approach that integrates forecasting control into the occupancy world model, enhancing prediction accuracy and controllability.
Findings
Achieves 26.3% better mIoU than DOME.
Achieves 23.7% better mIoU than UniScene.
Demonstrates consistent improvements across various configurations.
Abstract
World models are critical for autonomous driving to simulate environmental dynamics and generate synthetic data. Existing methods struggle to disentangle ego-vehicle motion (perspective shifts) from scene evolvement (agent interactions), leading to suboptimal predictions. Instead, we propose to separate environmental changes from ego-motion by leveraging the scene-centric coordinate systems. In this paper, we introduce COME: a framework that integrates scene-centric forecasting Control into the Occupancy world ModEl. Specifically, COME first generates ego-irrelevant, spatially consistent future features through a scene-centric prediction branch, which are then converted into scene condition using a tailored ControlNet. These condition features are subsequently injected into the occupancy world model, enabling more accurate and controllable future occupancy predictions. Experimental…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsData Quality and Management · Advanced Database Systems and Queries · Data Management and Algorithms
