From Imitation to Exploration: End-to-end Autonomous Driving based on World Model
Yueyuan Li, Mingyang Jiang, Songan Zhang, Wei Yuan, Chunxiang Wang,, and Ming Yang

TL;DR
This paper introduces RAMBLE, a novel end-to-end autonomous driving approach combining imitation learning and reinforcement learning with a world model, achieving state-of-the-art results in complex traffic scenarios.
Contribution
It proposes a hybrid IL-RL framework with a world model for autonomous driving, enhancing adaptability and robustness in dynamic environments.
Findings
Achieves state-of-the-art route completion on CARLA Leaderboard 1.0
Completes all scenarios on CARLA Leaderboard 2.0
Demonstrates improved handling of complex traffic scenarios
Abstract
In recent years, end-to-end autonomous driving architectures have gained increasing attention due to their advantage in avoiding error accumulation. Most existing end-to-end autonomous driving methods are based on Imitation Learning (IL), which can quickly derive driving strategies by mimicking expert behaviors. However, IL often struggles to handle scenarios outside the training dataset, especially in high-dynamic and interaction-intensive traffic environments. In contrast, Reinforcement Learning (RL)-based driving models can optimize driving decisions through interaction with the environment, improving adaptability and robustness. To leverage the strengths of both IL and RL, we propose RAMBLE, an end-to-end world model-based RL method for driving decision-making. RAMBLE extracts environmental context information from RGB images and LiDAR data through an asymmetrical variational…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic control and management · Autonomous Vehicle Technology and Safety · Vehicular Ad Hoc Networks (VANETs)
MethodsEntropy Regularization · Proximal Policy Optimization · CARLA: An Open Urban Driving Simulator
