Safe Value Functions

Pierre-Fran\c{c}ois Massiani; Steve Heim; Friedrich Solowjow,; Sebastian Trimpe

arXiv:2105.12204·eess.SY·June 10, 2024

Safe Value Functions

Pierre-Fran\c{c}ois Massiani, Steve Heim, Friedrich Solowjow,, Sebastian Trimpe

PDF

1 Repo

TL;DR

This paper formalizes the concept of safe value functions in reinforcement learning, establishing conditions under which safety constraints can be incorporated into optimal value functions through penalties, and providing insights for designing safe reward functions.

Contribution

It introduces the formal notion of safe value functions, analyzes the relationship between penalties, rewards, and safety, and offers practical heuristics for reward design in safety-critical control tasks.

Findings

01

Existence of finite penalties that induce safe value functions.

02

Larger penalties do not compromise optimality.

03

Clear structure of interactions between penalties, rewards, and dynamics.

Abstract

Safety constraints and optimality are important, but sometimes conflicting criteria for controllers. Although these criteria are often solved separately with different tools to maintain formal guarantees, it is also common practice in reinforcement learning to simply modify reward functions by penalizing failures, with the penalty treated as a mere heuristic. We rigorously examine the relationship of both safety and optimality to penalties, and formalize sufficient conditions for safe value functions (SVFs): value functions that are both optimal for a given task, and enforce safety constraints. We reveal this structure by examining when rewards preserve viability under optimal control, and show that there always exists a finite penalty that induces a safe value function. This penalty is not unique, but upper-unbounded: larger penalties do not harm optimality. Although it is often not…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sheim/vibly
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.