Kinematics-Aware Latent World Models for Data-Efficient Autonomous Driving
Jiazhuo Li, Linjiang Cao, Qi Liu, and Xi Xiong

TL;DR
This paper introduces a kinematics-aware latent world model for autonomous driving that improves data efficiency and policy stability by embedding vehicle motion dynamics and spatial structure into the model.
Contribution
It extends the Recurrent State-Space Model by incorporating kinematic information and geometry-aware supervision, enhancing latent dynamics for better driving policy learning.
Findings
Improved sample efficiency over baseline models.
Enhanced long-horizon imagination fidelity.
Better spatial representation in latent space.
Abstract
Data-efficient learning remains a central challenge in autonomous driving due to the high cost and safety risks of large-scale real-world interaction. Although world-model-based reinforcement learning enables policy optimization through latent imagination, existing approaches often lack explicit mechanisms to encode spatial and kinematic structure essential for driving tasks. In this work, we build upon the Recurrent State-Space Model (RSSM) and propose a kinematics-aware latent world model framework for autonomous driving. Vehicle kinematic information is incorporated into the observation encoder to ground latent transitions in physically meaningful motion dynamics, while geometry-aware supervision regularizes the RSSM latent state to capture task-relevant spatial structure beyond pixel reconstruction. The resulting structured latent dynamics improve long-horizon imagination fidelity…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · Reinforcement Learning in Robotics · Generative Adversarial Networks and Image Synthesis
