Model-predictive control and reinforcement learning in multi-energy system case studies
Glenn Ceusters, Rom\'an Cant\'u Rodr\'iguez, Alberte Bouso Garc\'ia,, R\"udiger Franke, Geert Deconinck, Lieve Helsen, Ann Now\'e, Maarten, Messagie, Luis Ramirez Camargo

TL;DR
This paper compares model-predictive control and reinforcement learning in multi-energy systems, showing RL can outperform traditional methods with proper training, especially when models are imperfect or complex.
Contribution
It introduces a model-free reinforcement learning approach for multi-energy systems and benchmarks it against linear MPC, demonstrating RL's potential to outperform traditional control methods.
Findings
RL outperforms LMPC in simple systems (101.5% vs 98%)
RL achieves higher performance in complex systems (94.6% vs 88.9%)
RL requires extensive training but offers adaptive control without a detailed system model.
Abstract
Model-predictive-control (MPC) offers an optimal control technique to establish and ensure that the total operation cost of multi-energy systems remains at a minimum while fulfilling all system constraints. However, this method presumes an adequate model of the underlying system dynamics, which is prone to modelling errors and is not necessarily adaptive. This has an associated initial and ongoing project-specific engineering cost. In this paper, we present an on- and off-policy multi-objective reinforcement learning (RL) approach, that does not assume a model a priori, benchmarking this against a linear MPC (LMPC - to reflect current practice, though non-linear MPC performs better) - both derived from the general optimal control problem, highlighting their differences and similarities. In a simple multi-energy system (MES) configuration case study, we show that a twin delayed deep…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
