Constrained Reinforcement Learning for Predictive Control in Real-Time Stochastic Dynamic Optimal Power Flow
Tong Wu, Anna Scaglione, Daniel Arnold

TL;DR
This paper introduces a primal-dual constrained deep reinforcement learning method for real-time dynamic optimal power flow, ensuring safety constraints are met while optimizing power system control.
Contribution
It proposes a novel primal-dual approach for constrained DRL in power systems and proves convergence of the networks, improving safety and optimality.
Findings
Outperforms existing methods in IEEE system case studies
Maintains safety constraints during dynamic control
Adapts effectively to changing system conditions
Abstract
Deep Reinforcement Learning (DRL) has become a popular method for solving control problems in power systems. Conventional DRL encourages the agent to explore various policies encoded in a neural network (NN) with the goal of maximizing the reward function. However, this approach can lead to infeasible solutions that violate physical constraints such as power flow equations, voltage limits, and dynamic constraints. Ensuring these constraints are met is crucial in power systems, as they are a safety critical infrastructure. To address this issue, existing DRL algorithms remedy the problem by projecting the actions onto the feasible set, which can result in sub-optimal solutions. This paper presents a novel primal-dual approach for learning optimal constrained DRL policies for dynamic optimal power flow problems, with the aim of controlling power generations and battery outputs. We also…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptimal Power Flow Distribution · Smart Grid Energy Management · Power System Optimization and Stability
