Exploring the Interplay Between Video Generation and World Models in Autonomous Driving: A Survey
Ao Fu, Yi Zhou, Tao Zhou, Yi Yang, Bojun Gao, Qun Li, Guobin Wu, Ling, Shao

TL;DR
This survey explores how the integration of world models and video generation, especially diffusion-based models, can enhance autonomous driving by improving simulation accuracy, situational awareness, and decision-making, while highlighting current challenges and future directions.
Contribution
It provides a comprehensive analysis of the relationship between world models and video generation in autonomous driving, emphasizing diffusion models and evaluating key metrics and approaches.
Findings
Diffusion-based models show promise in simulating driving scenarios.
Key evaluation metrics include Chamfer distance and FID.
Diverse interpretations of world models highlight the field's evolving understanding.
Abstract
World models and video generation are pivotal technologies in the domain of autonomous driving, each playing a critical role in enhancing the robustness and reliability of autonomous systems. World models, which simulate the dynamics of real-world environments, and video generation models, which produce realistic video sequences, are increasingly being integrated to improve situational awareness and decision-making capabilities in autonomous vehicles. This paper investigates the relationship between these two technologies, focusing on how their structural parallels, particularly in diffusion-based models, contribute to more accurate and coherent simulations of driving scenarios. We examine leading works such as JEPA, Genie, and Sora, which exemplify different approaches to world model design, thereby highlighting the lack of a universally accepted definition of world models. These…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutonomous Vehicle Technology and Safety
