Model-Free DRL Control for Power Inverters: From Policy Learning to Real-Time Implementation via Knowledge Distillation

Yang Yang; Chenggang Cui; Xitong Niu; Jiaming Liu; and Chuanlin Zhang

arXiv:2603.07964·eess.SY·March 10, 2026

Model-Free DRL Control for Power Inverters: From Policy Learning to Real-Time Implementation via Knowledge Distillation

Yang Yang, Chenggang Cui, Xitong Niu, Jiaming Liu, and Chuanlin Zhang

PDF

Open Access

TL;DR

This paper introduces a model-free DRL control framework for power inverters that uses policy distillation with an error energy-guided reward to improve transient response and reduce computational load, enabling real-time deployment.

Contribution

It proposes a novel hybrid reward mechanism and importance weighting in policy distillation to enhance control performance and computational efficiency in power inverter applications.

Findings

01

Reduces inference time to microseconds

02

Achieves superior transient response speed

03

Improves parameter robustness

Abstract

In response to the trade-off between control performance and computational burden hindering the deployment of Deep Reinforcement Learning (DRL) in power inverters, this paper presents a novel model-free control framework leveraging policy distillation. To handle the convergence instability and steady-state errors inherent in model-free agents, an error energy-guided hybrid reward mechanism is established to theoretically constrain the exploration space. More specifically, an adaptive importance weighting mechanism is integrated into the distillation architecture to amplify the significance of fluctuation regions, ensuring high-quality transfer of transient control logic by mitigating the observational bias dominated by steady-state data. This approach efficiently compresses the heavy DRL policy into a lightweight neural network, retaining the desired control performance while overcoming…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMicrogrid Control and Optimization · Sensorless Control of Electric Motors · Wind Turbine Control Systems