Optimal Management of the Peak Power Penalty for Smart Grids Using MPC-based Reinforcement Learning
Wenqi Cai, Hossein N. Esfahani, Arash B. Kordabad, and S\'ebastien, Gros

TL;DR
This paper presents an MPC-based reinforcement learning approach to optimize peak power management in smart grids, reducing economic costs by coordinating multi-agent systems with renewable energy and storage.
Contribution
It introduces a novel MPC-RL method combining parametric MPC and DPG to improve peak power cost management in multi-agent smart grids.
Findings
Significant reduction in long-term economic costs.
Effective coordination among agents with renewable sources.
Improved policy accuracy with DPG adjustments.
Abstract
The cost of the power distribution infrastructures is driven by the peak power encountered in the system. Therefore, the distribution network operators consider billing consumers behind a common transformer in the function of their peak demand and leave it to the consumers to manage their collective costs. This management problem is, however, not trivial. In this paper, we consider a multi-agent residential smart grid system, where each agent has local renewable energy production and energy storage, and all agents are connected to a local transformer. The objective is to develop an optimal policy that minimizes the economic cost consisting of both the spot-market cost for each consumer and their collective peak-power cost. We propose to use a parametric Model Predictive Control (MPC)-scheme to approximate the optimal policy. The optimality of this policy is limited by its finite horizon…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
