Robust Defense Against Extreme Grid Events Using Dual-Policy Reinforcement Learning Agents
Benjamin M. Peter, Mert Korkali

TL;DR
This paper demonstrates how dual-policy reinforcement learning agents, using PPO and GNNs within the Grid2Op platform, can effectively manage and defend power grids against extreme events and cyber threats, improving grid resilience.
Contribution
It introduces a novel dual-policy RL framework with opponent modeling for power grid defense, enhancing security assessment and contingency screening capabilities.
Findings
RL agents successfully avoid grid failure during simulated events
Opponent modeling improves the robustness of grid management strategies
The approach offers a new method for N-k contingency screening
Abstract
Reinforcement learning (RL) agents are powerful tools for managing power grids. They use large amounts of data to inform their actions and receive rewards or penalties as feedback to learn favorable responses for the system. Once trained, these agents can efficiently make decisions that would be too computationally complex for a human operator. This ability is especially valuable in decarbonizing power networks, where the demand for RL agents is increasing. These agents are well suited to control grid actions since the action space is constantly growing due to uncertainties in renewable generation, microgrid integration, and cybersecurity threats. To assess the efficacy of RL agents in response to an adverse grid event, we use the Grid2Op platform for agent training. We employ a proximal policy optimization (PPO) algorithm in conjunction with graph neural networks (GNNs). By simulating…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSmart Grid Security and Resilience
