Large Foundation Models for Power Systems
Chenghao Huang, Siyang Li, Ruohong Liu, Hao Wang, Yize Chen

TL;DR
This paper explores the application of large foundation models like GPT-4 in power systems, demonstrating their potential to improve efficiency and reliability across various tasks without task-specific training.
Contribution
It evaluates the performance of existing foundation models on key power system tasks and discusses their potential deployment in power system operations.
Findings
Foundation models perform well on power system tasks
They enhance efficiency and reliability in power system operations
Potential for future deployment in large-scale power systems
Abstract
Foundation models, such as Large Language Models (LLMs), can respond to a wide range of format-free queries without any task-specific data collection or model training, creating various research and application opportunities for the modeling and operation of large-scale power systems. In this paper, we outline how such large foundation model such as GPT-4 are developed, and discuss how they can be leveraged in challenging power and energy system tasks. We first investigate the potential of existing foundation models by validating their performance on four representative tasks across power system domains, including the optimal power flow (OPF), electric vehicle (EV) scheduling, knowledge retrieval for power engineering technical reports, and situation awareness. Our results indicate strong capabilities of such foundation models on boosting the efficiency and reliability of power system…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Power Systems and Technologies
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Residual Connection · Dropout · Layer Normalization · Dense Connections · Position-Wise Feed-Forward Layer · Absolute Position Encodings · Label Smoothing
