DeFIX: Detecting and Fixing Failure Scenarios with Reinforcement Learning in Imitation Learning Based Autonomous Driving

Resul Dagdanov; Feyza Eksen; Halil Durmus; Ferhat Yurdakul; Nazim Kemal Ure

arXiv:2210.16567·cs.RO·July 21, 2025

DeFIX: Detecting and Fixing Failure Scenarios with Reinforcement Learning in Imitation Learning Based Autonomous Driving

Resul Dagdanov, Feyza Eksen, Halil Durmus, Ferhat Yurdakul, Nazim Kemal Ure

PDF

2 Repos

TL;DR

DeFIX is a continuous learning framework that detects failure scenarios in imitation learning-based autonomous driving and trains reinforcement learning agents to fix these failures, improving safety and performance in urban environments.

Contribution

This paper introduces a novel iterative framework combining failure detection and reinforcement learning to enhance imitation learning in autonomous driving.

Findings

01

DeFIX outperforms state-of-the-art IL and RL benchmarks.

02

The framework effectively identifies failure scenarios in complex urban environments.

03

A single RL agent trained on failure scenarios can significantly improve driving safety.

Abstract

Safely navigating through an urban environment without violating any traffic rules is a crucial performance target for reliable autonomous driving. In this paper, we present a Reinforcement Learning (RL) based methodology to DEtect and FIX (DeFIX) failures of an Imitation Learning (IL) agent by extracting infraction spots and re-constructing mini-scenarios on these infraction areas to train an RL agent for fixing the shortcomings of the IL approach. DeFIX is a continuous learning framework, where extraction of failure scenarios and training of RL agents are executed in an infinite loop. After each new policy is trained and added to the library of policies, a policy classifier method effectively decides on which policy to activate at each step during the evaluation. It is demonstrated that even with only one RL agent trained on failure scenario of an IL agent, DeFIX method is either…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsLib · Entropy Regularization · Proximal Policy Optimization · *Communicated@Fast*How Do I Communicate to Expedia? · Residual Connection · 1x1 Convolution · Batch Normalization · Average Pooling · Dense Connections · Q-Learning