Safe Autonomous Racing via Approximate Reachability on Ego-vision
Bingqing Chen, Jonathan Francis, Jean Oh, Eric Nyberg, Sylvia L., Herbert

TL;DR
This paper introduces a novel safe reinforcement learning approach for autonomous racing that integrates Hamilton-Jacobi reachability with neural approximation to ensure safety using ego-vision inputs, achieving state-of-the-art results.
Contribution
It combines Hamilton-Jacobi reachability with neural networks to enable real-time safety verification directly from high-dimensional vision data in autonomous racing.
Findings
Fewer safety constraint violations compared to baselines in Safety Gym.
Achieved state-of-the-art performance on the Learn-to-Race benchmark.
Demonstrated neural approximation of HJ safety value on high-dimensional vision inputs.
Abstract
Racing demands each vehicle to drive at its physical limits, when any safety infraction could lead to catastrophic failure. In this work, we study the problem of safe reinforcement learning (RL) for autonomous racing, using the vehicle's ego-camera view and speed as input. Given the nature of the task, autonomous agents need to be able to 1) identify and avoid unsafe scenarios under the complex vehicle dynamics, and 2) make sub-second decision in a fast-changing environment. To satisfy these criteria, we propose to incorporate Hamilton-Jacobi (HJ) reachability theory, a safety verification method for general non-linear systems, into the constrained Markov decision process (CMDP) framework. HJ reachability not only provides a control-theoretic approach to learn about safety, but also enables low-latency safety verification. Though HJ reachability is traditionally not scalable to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Autonomous Vehicle Technology and Safety · Reinforcement Learning in Robotics
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
