D$^2$-World: An Efficient World Model through Decoupled Dynamic Flow
Haiming Zhang, Xu Yan, Ying Xue, Zixuan Guo, Shuguang Cui, Zhen Li,, Bingbing Liu

TL;DR
D$^2$-World is an efficient, decoupled dynamic flow-based world model that predicts future point clouds, achieving state-of-the-art results and significantly faster training times for autonomous systems.
Contribution
We propose D$^2$-World, a novel decoupled dynamic flow approach for efficient and accurate future point cloud prediction in world modeling.
Findings
Achieves state-of-the-art performance on the OpenScene benchmark.
Trains over 300% faster than baseline models.
Effectively predicts future dynamic and static voxels.
Abstract
This technical report summarizes the second-place solution for the Predictive World Model Challenge held at the CVPR-2024 Workshop on Foundation Models for Autonomous Systems. We introduce D-World, a novel World model that effectively forecasts future point clouds through Decoupled Dynamic flow. Specifically, the past semantic occupancies are obtained via existing occupancy networks (e.g., BEVDet). Following this, the occupancy results serve as the input for a single-stage world model, generating future occupancy in a non-autoregressive manner. To further simplify the task, dynamic voxel decoupling is performed in the world model. The model generates future dynamic voxels by warping the existing observations through voxel flow, while remaining static voxels can be easily obtained through pose transformation. As a result, our approach achieves state-of-the-art performance on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputability, Logic, AI Algorithms · Algorithms and Data Compression · Advanced Database Systems and Queries
