Navigating the Noise: Bringing Clarity to ML Parameterization Design with O(100) Ensembles
Jerry Lin, Sungduk Yu, Liran Peng, Tom Beucler, Eliot Wong-Toi, Zeyuan, Hu, Pierre Gentine, Margarita Geleta, Mike Pritchard

TL;DR
This paper introduces a scalable ensemble-based pipeline to systematically evaluate ML parameterizations for climate models, revealing how offline design choices impact online stability and performance.
Contribution
It presents an end-to-end pipeline for large ensemble simulations, enabling empirical assessment of offline-online performance relationships in ML climate parameterizations.
Findings
Removing dropout and changing loss functions have opposite effects on online stability.
Certain design choices like memory and humidity conversion do not compromise online performance.
Ensemble sizes of around 100 are needed to reliably detect causally relevant online differences.
Abstract
Machine-learning (ML) parameterizations of subgrid processes (here of turbulence, convection, and radiation) may one day replace conventional parameterizations by emulating high-resolution physics without the cost of explicit simulation. However, uncertainty about the relationship between offline and online performance (i.e., when integrated with a large-scale general circulation model (GCM)) hinders their development. Much of this uncertainty stems from limited sampling of the noisy, emergent effects of upstream ML design decisions on downstream online hybrid simulation. Our work rectifies the sampling issue via the construction of a semi-automated, end-to-end pipeline for size ensembles of hybrid simulations, revealing important nuances in how systematic reductions in offline error manifest in changes to online error and online stability. For example, removing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsClimate variability and models · Meteorological Phenomena and Simulations · Hydrology and Watershed Management Studies
