Self-Improving Safety Performance of Reinforcement Learning Based Driving with Black-Box Verification Algorithms

Resul Dagdanov; Halil Durmus; Nazim Kemal Ure

arXiv:2210.16575·cs.AI·July 21, 2025

Self-Improving Safety Performance of Reinforcement Learning Based Driving with Black-Box Verification Algorithms

Resul Dagdanov, Halil Durmus, Nazim Kemal Ure

PDF

Open Access 2 Repos

TL;DR

This paper introduces a self-improving framework for reinforcement learning-based autonomous driving that uses black-box verification to identify safety failures and iteratively retrains the agent to enhance safety performance, especially in critical scenarios.

Contribution

The paper presents a novel framework combining black-box verification with transfer learning to systematically improve RL-based driving safety, addressing training data limitations.

Findings

01

Efficient discovery of safety failure scenarios in RL driving agents.

02

Significant reduction in vehicle collisions through iterative retraining.

03

Enhanced generalization of RL agents in safety-critical situations.

Abstract

In this work, we propose a self-improving artificial intelligence system to enhance the safety performance of reinforcement learning (RL)-based autonomous driving (AD) agents using black-box verification methods. RL algorithms have become popular in AD applications in recent years. However, the performance of existing RL algorithms heavily depends on the diversity of training scenarios. A lack of safety-critical scenarios during the training phase could result in poor generalization performance in real-world driving applications. We propose a novel framework in which the weaknesses of the training set are explored through black-box verification methods. After discovering AD failure scenarios, the RL agent's training is re-initiated via transfer learning to improve the performance of previously unsafe scenarios. Simulation results demonstrate that our approach efficiently discovers…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAutonomous Vehicle Technology and Safety · Traffic control and management · Traffic and Road Safety

MethodsDiscriminative Adversarial Search · Self-Adversarial Negative Sampling · Self-adaptive Training · Selective Search · Monte-Carlo Tree Search · Entropy Regularization · Proximal Policy Optimization