Learning Safe Autonomous Driving Policies Using Predictive Safety Representations
Mahesh Keswani, Raunak Bhattacharyya

TL;DR
This paper evaluates the extension of the SRPL framework, which uses predictive safety representations, to real-world autonomous driving, demonstrating improved safety-performance tradeoffs and robustness across datasets.
Contribution
It shows that SRPL can enhance real-world autonomous driving safety and efficiency, with benefits influenced by the policy optimizer and dataset, and improves robustness and generalization.
Findings
SRPL improves success rate and reduces costs significantly
Predictive safety representations enhance robustness to noise
SRPL-augmented agents generalize better across datasets
Abstract
Safe reinforcement learning (SafeRL) is a prominent paradigm for autonomous driving, where agents are required to optimize performance under strict safety requirements. This dual objective creates a fundamental tension, as overly conservative policies limit driving efficiency while aggressive exploration risks safety violations. The Safety Representations for Safer Policy Learning (SRPL) framework addresses this challenge by equipping agents with a predictive model of future constraint violations and has shown promise in controlled environments. This paper investigates whether SRPL extends to real-world autonomous driving scenarios. Systematic experiments on the Waymo Open Motion Dataset (WOMD) and NuPlan demonstrate that SRPL can improve the reward-safety tradeoff, achieving statistically significant improvements in success rate (effect sizes r = 0.65-0.86) and cost reduction (effect…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · Adversarial Robustness in Machine Learning · Reinforcement Learning in Robotics
