Control Policy Correction Framework for Reinforcement Learning-based Energy Arbitrage Strategies
Seyed Soroush Karimi Madahi, Gargya Gokhale, Marie-Sophie Verwee, Bert, Claessens, Chris Develder

TL;DR
This paper introduces a safe reinforcement learning control framework for energy arbitrage in battery systems, combining revenue optimization with post-processing corrections to ensure safety, validated on real-world data and hardware.
Contribution
It presents a novel RL-based control framework with a post-processing correction step to guarantee safety in energy arbitrage applications, adaptable beyond this domain.
Findings
Effective in maximizing arbitrage revenue
Ensures safety through policy correction
Validated on real battery and Belgian market data
Abstract
A continuous rise in the penetration of renewable energy sources, along with the use of the single imbalance pricing, provides a new opportunity for balance responsible parties to reduce their cost through energy arbitrage in the imbalance settlement mechanism. Model-free reinforcement learning (RL) methods are an appropriate choice for solving the energy arbitrage problem due to their outstanding performance in solving complex stochastic sequential problems. However, RL is rarely deployed in real-world applications since its learned policy does not necessarily guarantee safety during the execution phase. In this paper, we propose a new RL-based control framework for batteries to obtain a safe energy arbitrage strategy in the imbalance settlement mechanism. In our proposed control framework, the agent initially aims to optimize the arbitrage revenue. Subsequently, in the post-processing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSmart Grid Energy Management · Transportation and Mobility Innovations
MethodsKnowledge Distillation
