Balancing Progress and Safety: A Novel Risk-Aware Objective for RL in Autonomous Driving

Ahmed Abouelazm; Jonas Michel; Helen Gremmelmaier; Tim Joseph; Philip Sch\"orner; and J. Marius Z\"ollner

arXiv:2505.06737·cs.RO·March 6, 2026

Balancing Progress and Safety: A Novel Risk-Aware Objective for RL in Autonomous Driving

Ahmed Abouelazm, Jonas Michel, Helen Gremmelmaier, Tim Joseph, Philip Sch\"orner, and J. Marius Z\"ollner

PDF

TL;DR

This paper introduces a risk-aware reward function for reinforcement learning in autonomous driving, improving safety and performance by addressing the limitations of traditional reward designs and incorporating a novel safety metric.

Contribution

It proposes a hierarchical, normalized reward formulation with a new risk-aware objective based on an ellipsoid function and RSS extension, enhancing safety in RL-driven autonomous driving.

Findings

01

Collision rates decreased by 21% on average.

02

The approach outperforms baseline rewards in route progress.

03

Maintains high cumulative rewards while improving safety.

Abstract

Reinforcement Learning (RL) is a promising approach for achieving autonomous driving due to robust decision-making capabilities. RL learns a driving policy through trial and error in traffic scenarios, guided by a reward function that combines the driving objectives. The design of such reward function has received insufficient attention, yielding ill-defined rewards with various pitfalls. Safety, in particular, has long been regarded only as a penalty for collisions. This leaves the risks associated with actions leading up to a collision unaddressed, limiting the applicability of RL in real-world scenarios. To address these shortcomings, our work focuses on enhancing the reward formulation by defining a set of driving objectives and structuring them hierarchically. Furthermore, we discuss the formulation of these objectives in a normalized manner to transparently determine their…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSparse Evolutionary Training