From Cumulative Constraints to Adaptive Runtime Safety Control for Nonstationary Reinforcement Learning

Timofey Tomashevskiy

arXiv:2605.18841·cs.LG·May 20, 2026

From Cumulative Constraints to Adaptive Runtime Safety Control for Nonstationary Reinforcement Learning

Timofey Tomashevskiy

PDF

TL;DR

This paper introduces CPSS, a runtime safety mechanism for reinforcement learning that adaptively converts cumulative safety budgets into local, state-level safety constraints, improving safety guarantees in nonstationary environments.

Contribution

The paper proposes a novel adaptive safety shield, CPSS, that dynamically adjusts safety thresholds based on contextual signals to ensure safety in nonstationary reinforcement learning.

Findings

01

CPSS reduces safety violations in highway merging scenarios.

02

CPSS enforces safety constraints while maintaining policy performance.

03

CPSS adapts safety thresholds online based on environment demands.

Abstract

Safety in reinforcement learning is often specified through cumulative cost constraints, but these trajectory-level guarantees do not directly prevent unsafe individual decisions, especially under nonstationarity. In continual and nonstationary settings, the difficulty is amplified because the risk associated with the same action can vary across contexts, while a fixed state-level threshold may be either too conservative or too weak. We propose Constraint Projection Safety Shield (CPSS), a runtime mechanism that converts a cumulative safety budget into adaptive state-level control constraints during execution. CPSS tracks the remaining safety budget, projects it into a time-varying admissible risk threshold, and filters policy actions whose predicted safety cost exceeds the active threshold. The threshold is adjusted online using contextual signals so that enforcement becomes stricter…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.