Safe reinforcement learning for multi-energy management systems with known constraint functions
Glenn Ceusters, Luis Ramirez Camargo, R\"udiger Franke, Ann Now\'e,, Maarten Messagie

TL;DR
This paper introduces two novel safe reinforcement learning methods, SafeFallback and GiveSafe, which ensure constraint satisfaction during training and deployment in multi-energy systems without complex optimization, improving safety and efficiency.
Contribution
The paper presents two new safe RL algorithms that decouple safety constraints from the RL formulation, providing hard guarantees without solving additional optimization problems.
Findings
SafeFallback outperforms vanilla RL in utility (102.9% vs 100%)
Both methods achieve higher utility than benchmarks (94.6% and 82.8%)
Methods provide safety guarantees even with random policies.
Abstract
Reinforcement learning (RL) is a promising optimal control technique for multi-energy management systems. It does not require a model a priori - reducing the upfront and ongoing project-specific engineering effort and is capable of learning better representations of the underlying system dynamics. However, vanilla RL does not provide constraint satisfaction guarantees - resulting in various potentially unsafe interactions within its safety-critical environment. In this paper, we present two novel safe RL methods, namely SafeFallback and GiveSafe, where the safety constraint formulation is decoupled from the RL formulation. These provide hard-constraint, rather than soft- and chance-constraint, satisfaction guarantees both during training a (near) optimal policy (which involves exploratory and exploitative, i.e. greedy, steps) as well as during deployment of any policy (e.g. random…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFuel Cells and Related Materials · Energy Efficiency and Management · Reinforcement Learning in Robotics
