Replacing Tunable Parameters in Weather and Climate Models with State-Dependent Functions using Reinforcement Learning
Pritthijit Nath, Sebastian Schemm, Henry Moss, Peter Haynes, Emily Shuckburgh, Mark J. Webb

TL;DR
This paper demonstrates that reinforcement learning can effectively learn state-dependent parametrisation schemes in idealised weather and climate models, improving accuracy and stability over static methods.
Contribution
It introduces a reinforcement learning framework for online, state-dependent parametrisation in climate models, outperforming static tuning in several idealised testbeds.
Findings
RL algorithms achieved high skill and stable convergence.
Single-agent RL outperformed static tuning in EBM.
Federated RL enabled specialized control and faster convergence.
Abstract
Weather and climate models rely on parametrisations to represent unresolved sub-grid processes. Traditional schemes rely on fixed coefficients that are weakly constrained and tuned offline, contributing to persistent biases that limit their ability to adapt to underlying physics. This study presents a framework that learns components of parametrisation schemes online as a function of the evolving model state using reinforcement learning (RL) and evaluates RL-driven parameter updates across idealised testbeds spanning a simple climate bias correction (SCBC), a radiative-convective equilibrium (RCE), and a zonal mean energy balance model (EBM) with single-agent and federated multi-agent settings. Across nine RL algorithms, Truncated Quantile Critics (TQC), Deep Deterministic Policy Gradient (DDPG), and Twin Delayed DDPG (TD3) achieved the highest skill and stable convergence, with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
