LiDARCrafter: Dynamic 4D World Modeling from LiDAR Sequences
Ao Liang, Youquan Liu, Yu Yang, Dongyue Lu, Linfeng Li, Lingdong Kong, Huaici Zhao, Wei Tsang Ooi

TL;DR
LiDARCrafter is a novel framework that generates and edits dynamic 4D LiDAR sequences from natural language instructions, enabling high-fidelity, controllable, and temporally coherent scene modeling for autonomous driving applications.
Contribution
It introduces a unified diffusion-based model for 4D LiDAR generation and editing conditioned on scene graphs, along with a benchmark for standardized evaluation.
Findings
Achieves state-of-the-art fidelity and controllability in LiDAR sequence generation.
Ensures high temporal coherence with smooth transitions in generated sequences.
Provides a comprehensive benchmark for evaluating 4D LiDAR generation methods.
Abstract
Generative world models have become essential data engines for autonomous driving, yet most existing efforts focus on videos or occupancy grids, overlooking the unique LiDAR properties. Extending LiDAR generation to dynamic 4D world modeling presents challenges in controllability, temporal coherence, and evaluation standardization. To this end, we present LiDARCrafter, a unified framework for 4D LiDAR generation and editing. Given free-form natural language inputs, we parse instructions into ego-centric scene graphs, which condition a tri-branch diffusion network to generate object structures, motion trajectories, and geometry. These structured conditions enable diverse and fine-grained scene editing. Additionally, an autoregressive module generates temporally coherent 4D LiDAR sequences with smooth transitions. To support standardized evaluation, we establish a comprehensive benchmark…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Multimodal Machine Learning Applications · 3D Shape Modeling and Analysis
