Accelerating Reinforcement Learning for Wind Farm Control via Expert Demonstrations

Marcus Binder Nilsen; Julian Quick; Tuhfe G\"o\c{c}men; Nikolay Dimitrov; Pierre-Elouan R\'ethor\'e

arXiv:2604.22794·eess.SY·April 28, 2026

Accelerating Reinforcement Learning for Wind Farm Control via Expert Demonstrations

Marcus Binder Nilsen, Julian Quick, Tuhfe G\"o\c{c}men, Nikolay Dimitrov, Pierre-Elouan R\'ethor\'e

PDF

TL;DR

This paper demonstrates that expert demonstrations from steady-state wake models can significantly accelerate reinforcement learning for wind farm control, reducing training time and improving initial performance.

Contribution

The study introduces a pretraining method using expert demonstrations to initialize RL agents, enhancing early performance and convergence speed in wind farm control tasks.

Findings

01

Pretraining eliminates the costly initial learning phase.

02

Pretrained agents start near baseline performance, outperforming untrained agents.

03

All agents eventually surpass lookup-table controllers after fine-tuning.

Abstract

Reinforcement learning (RL) offers a promising approach for adaptive wind farm flow control, yet its practical deployment is hindered by slow training convergence and poor initial performance, factors that could translate to years of reduced power output if an untrained agent were deployed directly. This work investigates whether domain knowledge from steady-state wake models can accelerate RL training and improve initial controller performance. We propose a pretraining methodology in which expert demonstrations are generated by deploying a PyWake-based steady-state optimizer within a dynamic wake simulation (WindGym), then used to initialize both the actor and critic networks of a Soft Actor-Critic agent via behavior cloning. Experiments on a 2x2 wind farm show that pretraining eliminates the costly initial learning phase: while an untrained agent underperforms the greedy zero-yaw…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.