From Imitation to Exploration: End-to-end Autonomous Driving based on   World Model

Yueyuan Li; Mingyang Jiang; Songan Zhang; Wei Yuan; Chunxiang Wang,; and Ming Yang

arXiv:2410.02253·cs.AI·April 22, 2025

From Imitation to Exploration: End-to-end Autonomous Driving based on World Model

Yueyuan Li, Mingyang Jiang, Songan Zhang, Wei Yuan, Chunxiang Wang,, and Ming Yang

PDF

Open Access

TL;DR

This paper introduces RAMBLE, a novel end-to-end autonomous driving approach combining imitation learning and reinforcement learning with a world model, achieving state-of-the-art results in complex traffic scenarios.

Contribution

It proposes a hybrid IL-RL framework with a world model for autonomous driving, enhancing adaptability and robustness in dynamic environments.

Findings

01

Achieves state-of-the-art route completion on CARLA Leaderboard 1.0

02

Completes all scenarios on CARLA Leaderboard 2.0

03

Demonstrates improved handling of complex traffic scenarios

Abstract

In recent years, end-to-end autonomous driving architectures have gained increasing attention due to their advantage in avoiding error accumulation. Most existing end-to-end autonomous driving methods are based on Imitation Learning (IL), which can quickly derive driving strategies by mimicking expert behaviors. However, IL often struggles to handle scenarios outside the training dataset, especially in high-dynamic and interaction-intensive traffic environments. In contrast, Reinforcement Learning (RL)-based driving models can optimize driving decisions through interaction with the environment, improving adaptability and robustness. To leverage the strengths of both IL and RL, we propose RAMBLE, an end-to-end world model-based RL method for driving decision-making. RAMBLE extracts environmental context information from RGB images and LiDAR data through an asymmetrical variational…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTraffic control and management · Autonomous Vehicle Technology and Safety · Vehicular Ad Hoc Networks (VANETs)

MethodsEntropy Regularization · Proximal Policy Optimization · CARLA: An Open Urban Driving Simulator