Automatically Learning Fallback Strategies with Model-Free Reinforcement   Learning in Safety-Critical Driving Scenarios

Ugo Lecerf; Christelle Yemdji-Tchassi; S\'ebastien Aubert; Pietro; Michiardi

arXiv:2204.05196·cs.LG·April 12, 2022

Automatically Learning Fallback Strategies with Model-Free Reinforcement Learning in Safety-Critical Driving Scenarios

Ugo Lecerf, Christelle Yemdji-Tchassi, S\'ebastien Aubert, Pietro, Michiardi

PDF

TL;DR

This paper introduces a model-free reinforcement learning approach that learns multiple fallback strategies for autonomous driving, enhancing safety by enabling the vehicle to handle unexpected environmental changes.

Contribution

It proposes a novel pseudo-reward based on trajectory distance to encourage exploration of diverse behaviors, capturing multiple fallback strategies in autonomous vehicle control.

Findings

01

Learned fallback strategies improve safety in autonomous driving scenarios.

02

Method captures multiple behavioral modes not found by standard RL.

03

Enhanced robustness to environmental uncertainties.

Abstract

When learning to behave in a stochastic environment where safety is critical, such as driving a vehicle in traffic, it is natural for human drivers to plan fallback strategies as a backup to use if ever there is an unexpected change in the environment. Knowing to expect the unexpected, and planning for such outcomes, increases our capability for being robust to unseen scenarios and may help prevent catastrophic failures. Control of Autonomous Vehicles (AVs) has a particular interest in knowing when and how to use fallback strategies in the interest of safety. Due to imperfect information available to an AV about its environment, it is important to have alternate strategies at the ready which might not have been deduced from the original training data distribution. In this paper we present a principled approach for a model-free Reinforcement Learning (RL) agent to capture multiple…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsBalanced Selection