Safe reinforcement learning for multi-energy management systems with   known constraint functions

Glenn Ceusters; Luis Ramirez Camargo; R\"udiger Franke; Ann Now\'e,; Maarten Messagie

arXiv:2207.03830·eess.SY·September 2, 2022·1 cites

Safe reinforcement learning for multi-energy management systems with known constraint functions

Glenn Ceusters, Luis Ramirez Camargo, R\"udiger Franke, Ann Now\'e,, Maarten Messagie

PDF

Open Access

TL;DR

This paper introduces two novel safe reinforcement learning methods, SafeFallback and GiveSafe, which ensure constraint satisfaction during training and deployment in multi-energy systems without complex optimization, improving safety and efficiency.

Contribution

The paper presents two new safe RL algorithms that decouple safety constraints from the RL formulation, providing hard guarantees without solving additional optimization problems.

Findings

01

SafeFallback outperforms vanilla RL in utility (102.9% vs 100%)

02

Both methods achieve higher utility than benchmarks (94.6% and 82.8%)

03

Methods provide safety guarantees even with random policies.

Abstract

Reinforcement learning (RL) is a promising optimal control technique for multi-energy management systems. It does not require a model a priori - reducing the upfront and ongoing project-specific engineering effort and is capable of learning better representations of the underlying system dynamics. However, vanilla RL does not provide constraint satisfaction guarantees - resulting in various potentially unsafe interactions within its safety-critical environment. In this paper, we present two novel safe RL methods, namely SafeFallback and GiveSafe, where the safety constraint formulation is decoupled from the RL formulation. These provide hard-constraint, rather than soft- and chance-constraint, satisfaction guarantees both during training a (near) optimal policy (which involves exploratory and exploitative, i.e. greedy, steps) as well as during deployment of any policy (e.g. random…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFuel Cells and Related Materials · Energy Efficiency and Management · Reinforcement Learning in Robotics