DRARL: Disengagement-Reason-Augmented Reinforcement Learning for Efficient Improvement of Autonomous Driving Policy

Weitao Zhou; Bo Zhang; Zhong Cao; Xiang Li; Qian Cheng; Chunyang Liu; Yaqin Zhang; Diange Yang

arXiv:2506.16720·cs.RO·June 23, 2025

DRARL: Disengagement-Reason-Augmented Reinforcement Learning for Efficient Improvement of Autonomous Driving Policy

Weitao Zhou, Bo Zhang, Zhong Cao, Xiang Li, Qian Cheng, Chunyang Liu, Yaqin Zhang, Diange Yang

PDF

Open Access

TL;DR

This paper introduces DRARL, a reinforcement learning approach that uses disengagement reasons to selectively improve autonomous driving policies, effectively handling rare and semantically similar cases without over-conservatism.

Contribution

The work proposes a novel method that identifies disengagement reasons using OOD detection and leverages this information for targeted policy enhancement in autonomous driving.

Findings

01

Accurately identifies policy-related disengagement reasons.

02

Improves policy performance on similar disengagement cases.

03

Prevents over-conservatism after policy updates.

Abstract

With the increasing presence of automated vehicles on open roads under driver supervision, disengagement cases are becoming more prevalent. While some data-driven planning systems attempt to directly utilize these disengagement cases for policy improvement, the inherent scarcity of disengagement data (often occurring as a single instances) restricts training effectiveness. Furthermore, some disengagement data should be excluded since the disengagement may not always come from the failure of driving policies, e.g. the driver may casually intervene for a while. To this end, this work proposes disengagement-reason-augmented reinforcement learning (DRARL), which enhances driving policy improvement process according to the reason of disengagement cases. Specifically, the reason of disengagement is identified by a out-of-distribution (OOD) state estimation model. When the reason doesn't…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAutonomous Vehicle Technology and Safety · Reinforcement Learning in Robotics · Transportation and Mobility Innovations