Replacing Tunable Parameters in Weather and Climate Models with State-Dependent Functions using Reinforcement Learning

Pritthijit Nath; Sebastian Schemm; Henry Moss; Peter Haynes; Emily Shuckburgh; Mark J. Webb

arXiv:2601.04268·cs.LG·April 9, 2026

Replacing Tunable Parameters in Weather and Climate Models with State-Dependent Functions using Reinforcement Learning

Pritthijit Nath, Sebastian Schemm, Henry Moss, Peter Haynes, Emily Shuckburgh, Mark J. Webb

PDF

TL;DR

This paper demonstrates that reinforcement learning can effectively learn state-dependent parametrisation schemes in idealised weather and climate models, improving accuracy and stability over static methods.

Contribution

It introduces a reinforcement learning framework for online, state-dependent parametrisation in climate models, outperforming static tuning in several idealised testbeds.

Findings

01

RL algorithms achieved high skill and stable convergence.

02

Single-agent RL outperformed static tuning in EBM.

03

Federated RL enabled specialized control and faster convergence.

Abstract

Weather and climate models rely on parametrisations to represent unresolved sub-grid processes. Traditional schemes rely on fixed coefficients that are weakly constrained and tuned offline, contributing to persistent biases that limit their ability to adapt to underlying physics. This study presents a framework that learns components of parametrisation schemes online as a function of the evolving model state using reinforcement learning (RL) and evaluates RL-driven parameter updates across idealised testbeds spanning a simple climate bias correction (SCBC), a radiative-convective equilibrium (RCE), and a zonal mean energy balance model (EBM) with single-agent and federated multi-agent settings. Across nine RL algorithms, Truncated Quantile Critics (TQC), Deep Deterministic Policy Gradient (DDPG), and Twin Delayed DDPG (TD3) achieved the highest skill and stable convergence, with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.