Efficient Learning of Voltage Control Strategies via Model-based Deep Reinforcement Learning
Ramij R. Hossain, Tianzhixi Yin, Yan Du, Renke Huang, Jie Tan, Wenhao, Yu, Yuan Liu, Qiuhua Huang

TL;DR
This paper introduces a model-based deep reinforcement learning approach using a neural network surrogate model to efficiently learn voltage control strategies in power systems, significantly reducing training time and improving safety.
Contribution
It presents a novel framework combining model-based DRL with imitation learning and reward shaping, enhancing training efficiency and stability for large-scale power system control.
Findings
Achieved 97.5% sample efficiency in training.
Attained 87.7% training efficiency on IEEE 300-bus system.
Demonstrated effective voltage control strategy learning.
Abstract
This article proposes a model-based deep reinforcement learning (DRL) method to design emergency control strategies for short-term voltage stability problems in power systems. Recent advances show promising results in model-free DRL-based methods for power systems, but model-free methods suffer from poor sample efficiency and training time, both critical for making state-of-the-art DRL algorithms practically applicable. DRL-agent learns an optimal policy via a trial-and-error method while interacting with the real-world environment. And it is desirable to minimize the direct interaction of the DRL agent with the real-world power grid due to its safety-critical nature. Additionally, state-of-the-art DRL-based policies are mostly trained using a physics-based grid simulator where dynamic simulation is computationally intensive, lowering the training efficiency. We propose a novel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPower System Optimization and Stability · Optimal Power Flow Distribution · Smart Grid Energy Management
MethodsTest
