Quantum Reinforcement Learning-Based Two-Stage Unit Commitment Framework for Enhanced Power Systems Robustness
Xiang Wei, Ziqing Zhu, Linghua Zhu, Ze Hu, Xian Zhang, Guibin Wang,, Siqi Bu, Ka Wing Chan

TL;DR
This paper introduces a quantum reinforcement learning-based two-stage unit commitment framework that enhances power system robustness by proactively managing renewable energy uncertainties using virtual power plants.
Contribution
It presents a novel two-stage UC model integrating foresight-seeing decision-making with quantum reinforcement learning for improved efficiency and adaptability.
Findings
Outperforms existing methods in computational efficiency.
Enhances real-time responsiveness in power system scheduling.
Achieves higher solution quality with quantum algorithms.
Abstract
Unit commitment (UC) optimizes the start-up and shutdown schedules of generating units to meet load demand while minimizing costs. However, the increasing integration of renewable energy introduces uncertainties for real-time scheduling. Existing solutions face limitations both in modeling and algorithmic design. At the modeling level, they fail to incorporate widely adopted virtual power plants (VPPs) as flexibility resources, missing the opportunity to proactively mitigate potential real-time imbalances or ramping constraints through foresight-seeing decision-making. At the algorithmic level, existing probabilistic optimization, multi-stage approaches, and machine learning, face challenges in computational complexity and adaptability. To address these challenges, this study proposes a novel two-stage UC framework that incorporates foresight-seeing sequential decision-making in both…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPower System Optimization and Stability · Smart Grid Energy Management · Optimal Power Flow Distribution
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
