Safe Autonomous Racing via Approximate Reachability on Ego-vision

Bingqing Chen; Jonathan Francis; Jean Oh; Eric Nyberg; Sylvia L.; Herbert

arXiv:2110.07699·cs.RO·December 2, 2021·5 cites

Safe Autonomous Racing via Approximate Reachability on Ego-vision

Bingqing Chen, Jonathan Francis, Jean Oh, Eric Nyberg, Sylvia L., Herbert

PDF

Open Access

TL;DR

This paper introduces a novel safe reinforcement learning approach for autonomous racing that integrates Hamilton-Jacobi reachability with neural approximation to ensure safety using ego-vision inputs, achieving state-of-the-art results.

Contribution

It combines Hamilton-Jacobi reachability with neural networks to enable real-time safety verification directly from high-dimensional vision data in autonomous racing.

Findings

01

Fewer safety constraint violations compared to baselines in Safety Gym.

02

Achieved state-of-the-art performance on the Learn-to-Race benchmark.

03

Demonstrated neural approximation of HJ safety value on high-dimensional vision inputs.

Abstract

Racing demands each vehicle to drive at its physical limits, when any safety infraction could lead to catastrophic failure. In this work, we study the problem of safe reinforcement learning (RL) for autonomous racing, using the vehicle's ego-camera view and speed as input. Given the nature of the task, autonomous agents need to be able to 1) identify and avoid unsafe scenarios under the complex vehicle dynamics, and 2) make sub-second decision in a fast-changing environment. To satisfy these criteria, we propose to incorporate Hamilton-Jacobi (HJ) reachability theory, a safety verification method for general non-linear systems, into the constrained Markov decision process (CMDP) framework. HJ reachability not only provides a control-theoretic approach to learn about safety, but also enables low-latency safety verification. Though HJ reachability is traditionally not scalable to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Autonomous Vehicle Technology and Safety · Reinforcement Learning in Robotics

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings