Reinforced Refinement with Self-Aware Expansion for End-to-End Autonomous Driving

Haochen Liu; Tianyu Li; Haohan Yang; Li Chen; Caojun Wang; Ke Guo; Haochen Tian; Hongchen Li; Hongyang Li; Chen Lv

arXiv:2506.09800·cs.RO·June 12, 2025

Reinforced Refinement with Self-Aware Expansion for End-to-End Autonomous Driving

Haochen Liu, Tianyu Li, Haohan Yang, Li Chen, Caojun Wang, Ke Guo, Haochen Tian, Hongchen Li, Hongyang Li, Chen Lv

PDF

Open Access 1 Models

TL;DR

This paper introduces R2SE, a reinforcement learning-based pipeline that refines autonomous driving models by targeting hard cases, improving generalization, safety, and robustness in end-to-end driving systems.

Contribution

The paper proposes a novel reinforcement learning framework with dynamic policy expansion and targeted refinement to enhance end-to-end autonomous driving models.

Findings

01

Improved generalization and safety in simulation and real-world datasets.

02

Enhanced robustness of long-horizon driving policies.

03

State-of-the-art performance over existing end-to-end systems.

Abstract

End-to-end autonomous driving has emerged as a promising paradigm for directly mapping sensor inputs to planning maneuvers using learning-based modular integrations. However, existing imitation learning (IL)-based models suffer from generalization to hard cases, and a lack of corrective feedback loop under post-deployment. While reinforcement learning (RL) offers a potential solution to tackle hard cases with optimality, it is often hindered by overfitting to specific driving cases, resulting in catastrophic forgetting of generalizable knowledge and sample inefficiency. To overcome these challenges, we propose Reinforced Refinement with Self-aware Expansion (R2SE), a novel learning pipeline that constantly refines hard domain while keeping generalizable driving policy for model-agnostic end-to-end driving systems. Through reinforcement fine-tuning and policy expansion that facilitates…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
unknownuser6666/R2SE_model
model

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Autonomous Vehicle Technology and Safety · Robot Manipulation and Learning