Safe Continual Reinforcement Learning under Nonstationarity via Adaptive Safety Constraints
Timofey Tomashevskiy

TL;DR
This paper introduces LILAC+, an adaptive safety framework for continual reinforcement learning that dynamically adjusts safety constraints in response to environmental nonstationarity, improving safety and performance.
Contribution
The paper presents a novel framework combining three adaptive safety mechanisms for safe reinforcement learning in changing environments, addressing limitations of fixed constraints.
Findings
Adaptive safety constraints reduce safety violations under distribution shift.
LILAC+ maintains competitive task performance with improved safety.
Framework effectively handles both seen and unseen nonstationary conditions.
Abstract
Safe reinforcement learning in nonstationary environments requires safety mechanisms that adapt as environmental conditions change. Standard safe reinforcement learning methods often assume fixed constraints or stable environmental conditions, which can become inadequate under distribution shift. We propose LILAC+, a framework for safe continual reinforcement learning under nonstationarity that combines three adaptive safety mechanisms: context-based safety constraints, adaptation-speed constraints, and budget-to-state safety enforcement. Context-based constraints adjust safety requirements using inferred and predicted environmental context. Adaptation-speed constraints tighten safety requirements when the rate of environmental change exceeds the agent's ability to adapt safely. Budget-to-state enforcement converts cumulative safety requirements into local state-level control…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
