Joint Energy Dispatch and Unit Commitment in Microgrids Based on Deep Reinforcement Learning
Jiaju Qi, Lei Lei, Kan Zheng, Simon X. Yang

TL;DR
This paper introduces a novel deep reinforcement learning algorithm, HAFH-DDPG, for joint energy dispatch and unit commitment in microgrids, effectively reducing costs while managing hybrid action spaces.
Contribution
It proposes a hybrid DRL algorithm combining DQN and DDPG within a finite-horizon framework for MG energy management, addressing hybrid action space challenges.
Findings
HAFH-DDPG outperforms baseline algorithms in cost reduction.
The diesel generator selection strategy simplifies the action space.
Experimental results validate the algorithm's effectiveness with real-world data.
Abstract
Nowadays, the application of microgrids (MG) with renewable energy is becoming more and more extensive, which creates a strong need for dynamic energy management. In this paper, deep reinforcement learning (DRL) is applied to learn an optimal policy for making joint energy dispatch (ED) and unit commitment (UC) decisions in an isolated MG, with the aim for reducing the total power generation cost on the premise of ensuring the supply-demand balance. In order to overcome the challenge of discrete-continuous hybrid action space due to joint ED and UC, we propose a DRL algorithm, i.e., the hybrid action finite-horizon DDPG (HAFH-DDPG), that seamlessly integrates two classical DRL algorithms, i.e., deep Q-network (DQN) and deep deterministic policy gradient (DDPG), based on a finite-horizon dynamic programming (DP) framework. Moreover, a diesel generator (DG) selection strategy is presented…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMicrogrid Control and Optimization · Smart Grid Energy Management · Optimal Power Flow Distribution
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Weight Decay · Adam · Experience Replay · Convolution · Dense Connections · Batch Normalization · Deep Deterministic Policy Gradient
