A Survey of World Models for Autonomous Driving
Tuo Feng, Wenguan Wang, Yi Yang

TL;DR
This survey reviews recent advances in world models for autonomous driving, highlighting their role in environment understanding, behavior planning, and multi-agent interaction, and discusses future research directions for deployment in complex urban scenarios.
Contribution
It provides a comprehensive taxonomy and analysis of recent world modeling techniques, training paradigms, and challenges specific to autonomous driving.
Findings
Diffusion models improve scene evolution prediction.
Multi-agent collaboration enhances decision-making.
Self-supervised learning boosts model robustness.
Abstract
Recent breakthroughs in autonomous driving have been propelled by advances in robust world modeling, fundamentally transforming how vehicles interpret dynamic scenes and execute safe decision-making. World models have emerged as a linchpin technology, offering high-fidelity representations of the driving environment that integrate multi-sensor data, semantic cues, and temporal dynamics. This paper systematically reviews recent advances in world models for autonomous driving, proposing a three-tiered taxonomy: (i) Generation of Future Physical World, covering Image-, BEV-, OG-, and PC-based generation methods that enhance scene evolution modeling through diffusion models and 4D occupancy forecasting; (ii) Behavior Planning for Intelligent Agents, combining rule-driven and learning-based paradigms with cost map optimization and reinforcement learning for trajectory generation in complex…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · Transportation and Mobility Innovations
MethodsDiffusion · Focus
