A Reinforcement Learning Approach to Parameter Selection for Distributed   Optimal Power Flow

Sihan Zeng; Alyssa Kody; Youngdae Kim; Kibaek Kim; Daniel K. Molzahn

arXiv:2110.11991·eess.SY·May 9, 2022

A Reinforcement Learning Approach to Parameter Selection for Distributed Optimal Power Flow

Sihan Zeng, Alyssa Kody, Youngdae Kim, Kibaek Kim, Daniel K. Molzahn

PDF

Open Access

TL;DR

This paper introduces a reinforcement learning method to adaptively select penalty parameters in distributed optimal power flow problems, significantly improving convergence speed and robustness over heuristic methods.

Contribution

It develops a deep Q-learning based policy for penalty parameter selection in ADMM for ACOPF, demonstrating substantial acceleration and generalizability.

Findings

01

Up to 59% reduction in iteration count compared to heuristic methods.

02

Policy generalizes well to unseen loading and system loss scenarios.

03

Provides proof-of-concept for RL in parameter tuning for power system optimization.

Abstract

With the increasing penetration of distributed energy resources, distributed optimization algorithms have attracted significant attention for power systems applications due to their potential for superior scalability, privacy, and robustness to a single point-of-failure. The Alternating Direction Method of Multipliers (ADMM) is a popular distributed optimization algorithm; however, its convergence performance is highly dependent on the selection of penalty parameters, which are usually chosen heuristically. In this work, we use reinforcement learning (RL) to develop an adaptive penalty parameter selection policy for the AC optimal power flow (ACOPF) problem solved via ADMM with the goal of minimizing the number of iterations until convergence. We train our RL policy using deep Q-learning, and show that this policy can result in significantly accelerated convergence (up to a 59%…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOptimal Power Flow Distribution · Power System Optimization and Stability · Microgrid Control and Optimization

MethodsAlternating Direction Method of Multipliers