Raw2Drive: Reinforcement Learning with Aligned World Models for End-to-End Autonomous Driving (in CARLA v2)

Zhenjie Yang; Xiaosong Jia; Qifeng Li; Xue Yang; Maoqing Yao; Junchi Yan

arXiv:2505.16394·cs.RO·October 28, 2025

Raw2Drive: Reinforcement Learning with Aligned World Models for End-to-End Autonomous Driving (in CARLA v2)

Zhenjie Yang, Xiaosong Jia, Qifeng Li, Xue Yang, Maoqing Yao, Junchi Yan

PDF

Open Access

TL;DR

Raw2Drive introduces a novel reinforcement learning framework for end-to-end autonomous driving that effectively utilizes aligned world models trained on privileged and raw sensor data, achieving state-of-the-art results in CARLA benchmarks.

Contribution

The paper presents Raw2Drive, a dual-stream model-based RL approach that bridges privileged and raw sensor data for autonomous driving, a novel method in this domain.

Findings

01

Achieves state-of-the-art performance on CARLA Leaderboard 2.0

02

First RL-based end-to-end autonomous driving method on CARLA benchmarks

03

Effectively aligns privileged and raw sensor world models during training

Abstract

Reinforcement Learning (RL) can mitigate the causal confusion and distribution shift inherent to imitation learning (IL). However, applying RL to end-to-end autonomous driving (E2E-AD) remains an open problem for its training difficulty, and IL is still the mainstream paradigm in both academia and industry. Recently Model-based Reinforcement Learning (MBRL) have demonstrated promising results in neural planning; however, these methods typically require privileged information as input rather than raw sensor data. We fill this gap by designing Raw2Drive, a dual-stream MBRL approach. Initially, we efficiently train an auxiliary privileged world model paired with a neural planner that uses privileged information as input. Subsequently, we introduce a raw sensor world model trained via our proposed Guidance Mechanism, which ensures consistency between the raw sensor world model and the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAutonomous Vehicle Technology and Safety · Reinforcement Learning in Robotics · Adversarial Robustness in Machine Learning

MethodsEntropy Regularization · Proximal Policy Optimization · CARLA: An Open Urban Driving Simulator