Heterogeneous Multi-Agent Proximal Policy Optimization for Power Distribution System Restoration
Parya Dolatyabi, Ali Farajzadeh Bavil, Mahdi Khodayar

TL;DR
This paper introduces a heterogeneous multi-agent reinforcement learning framework using HAPPO to efficiently restore power distribution systems after outages, handling complex nonlinear constraints and microgrid coordination.
Contribution
It develops a novel Heterogeneous-Agent Proximal Policy Optimization method for scalable, stable, and feasible power system restoration across interconnected microgrids.
Findings
HAPPO outperforms PPO, QMIX, and Mean-Field RL in restoration tasks.
Achieves over 95% load restoration with low latency.
Demonstrates stability and reproducibility across multiple runs.
Abstract
Restoring power distribution systems (PDSs) after large-scale outages requires sequential switching actions that reconfigure feeder topology and coordinate distributed energy resources (DERs) under nonlinear constraints, including power balance, voltage limits, and thermal ratings. These challenges limit the scalability of conventional optimization and value-based reinforcement learning (RL) approaches. This paper applies a Heterogeneous-Agent Reinforcement Learning (HARL) framework via Heterogeneous-Agent Proximal Policy Optimization (HAPPO) to enable coordinated restoration across interconnected microgrids. Each agent controls a distinct microgrid with different loads, DER capacities, and switch counts. Decentralized actors are trained with a centralized critic for stable on-policy learning, while a physics-informed OpenDSS environment enforces electrical feasibility. Experiments on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptimal Power Flow Distribution · Power System Optimization and Stability · Electric Power System Optimization
