Adaptive Shielding for Safe Reinforcement Learning under Hidden-Parameter Dynamics Shifts
Minjae Kwon, Tyler Ingebrand, Ufuk Topcu, Lu Feng

TL;DR
This paper introduces Adaptive Shielding, a framework for safe reinforcement learning that adapts to unseen environment dynamics shifts using online inference and uncertainty bounds, improving safety and performance.
Contribution
It proposes a novel adaptive shielding framework with online dynamics inference and safety guarantees for safe reinforcement learning under hidden-parameter shifts.
Findings
Outperforms baselines on Safe-Gym benchmarks.
Generalizes reliably to unseen dynamics.
Maintains safety with modest computational overhead.
Abstract
Unseen shifts in environment dynamics, driven by hidden parameters such as friction or gravity, create a challenge for maintaining safety. We address this challenge by proposing Adaptive Shielding, a framework for safe reinforcement learning in constrained hidden-parameter Markov decision processes. A function encoder infers a low-dimensional representation of the underlying dynamics online from transition data, allowing the shield to adapt. To ensure safety during this process, we use a two-layer strategy. First, we introduce safety-regularized optimization that proactively trains the policy away from high-cost regions. Second, the adaptive shielding reactively uses the inferred dynamics to forecast safety risks and applies uncertainty-aware bounds using conformal prediction to filter unsafe actions. We prove that prediction errors in the shielding connect with bounds on the average…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiation Effects in Electronics · Software Reliability and Analysis Research · Adversarial Robustness in Machine Learning
