GenCos' Behaviors Modeling Based on Q Learning Improved by Dichotomy

Qiangang Jia; Zhaoyu Hu; Yiyan Li; Zheng Yan; Sijie Chen

arXiv:2008.01536·eess.SY·August 5, 2020

GenCos' Behaviors Modeling Based on Q Learning Improved by Dichotomy

Qiangang Jia, Zhaoyu Hu, Yiyan Li, Zheng Yan, Sijie Chen

PDF

Open Access

TL;DR

This paper introduces an improved Q learning algorithm using dichotomy to model GenCos' behaviors more efficiently in electricity markets, reducing convergence time through state and action space dichotomization.

Contribution

The paper proposes a novel Q learning method enhanced by dichotomy, significantly improving convergence efficiency in modeling GenCos' behaviors.

Findings

01

Effective in reducing convergence time

02

Demonstrated success in repeated Cournot game simulations

03

Improves practical applicability of Q learning in market modeling

Abstract

Q learning is widely used to simulate the behaviors of generation companies (GenCos) in an electricity market. However, existing Q learning method usually requires numerous iterations to converge, which is time-consuming and inefficient in practice. To enhance the calculation efficiency, a novel Q learning algorithm improved by dichotomy is proposed in this paper. This method modifies the update process of the Q table by dichotomizing the state space and the action space step by step. Simulation results in a repeated Cournot game show the effectiveness of the proposed algorithm.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsElectric Power System Optimization · Energy Load and Power Forecasting · Auction Theory and Applications