LLM-Guided Safe Reinforcement Learning for Energy System Topology Reconfiguration
Zongyan Zhang, Chao Shen, Xu Wan, Jie Song, Mingyang Sun

TL;DR
This paper introduces a novel safe reinforcement learning framework that combines Large Language Models with a Safety Soft Actor-Critic algorithm to improve the safety, efficiency, and scalability of power grid topology reconfiguration.
Contribution
It proposes an integrated LLM-guided safe RL approach with domain knowledge to enhance decision-making in complex energy system topology control.
Findings
Improved reward and safety metrics on IEEE benchmarks.
Longer operational survival times compared to baseline methods.
Effective risk-aware policy optimization with LLM guidance.
Abstract
The increasing penetration of renewable generation and the growing variability of electrified demand introduce substantial operational uncertainty to modern power systems. Topology reconfiguration is widely recognized as an effective and economical means to enhance grid resilience. Due to the coexistence of AC power-flow constraints and discrete switching decisions, topology reconfiguration in large-scale systems leads to a highly nonlinear and nonconvex optimization problem, making traditional methods computationally prohibitive. Consequently, several studies have explored reinforcement learning-based approaches to improve scalability and operational efficiency. However, its practical implementation is challenged by the high-dimensional combinatorial action space and the need to ensure safety during learning-based decision-making. To address these challenges, this paper presents a safe…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptimal Power Flow Distribution · Power System Optimization and Stability · Smart Grid Security and Resilience
