Fine-Tuning LLMs to Generate Economical and Reliable Actions for the Power Grid
Mohamad Chehade, Hao Zhu

TL;DR
This paper introduces a multi-stage fine-tuning approach for large language models to generate reliable and economical corrective actions for power grid safety shutoffs, improving feasibility and voltage stability.
Contribution
It presents a novel pipeline combining supervised fine-tuning, preference optimization, and best-of-N selection to enhance LLM performance in power grid scenarios.
Findings
Significant reduction in power-flow failures from 50% to single digits.
Improved DC objective values over zero-shot methods.
Enhanced voltage-penalty outcomes on IEEE 118-bus scenarios.
Abstract
Public Safety Power Shutoffs (PSPS) force rapid topology changes that can render standard operating points infeasible, requiring operators to quickly identify corrective transmission switching actions that reduce load shedding while maintaining acceptable voltage behavior. We present a verifiable, multi-stage adaptation pipeline that fine-tunes an instruction-tuned large language model (LLM) to generate \emph{open-only} corrective switching plans from compact PSPS scenario summaries under an explicit switching budget. First, supervised fine-tuning distills a DC-OPF MILP oracle into a constrained action grammar that enables reliable parsing and feasibility checks. Second, direct preference optimization refines the policy using AC-evaluated preference pairs ranked by a voltage-penalty metric, injecting voltage-awareness beyond DC imitation. Finally, best-of- selection provides an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptimal Power Flow Distribution · Power System Optimization and Stability · Formal Methods in Verification
