ClimART: A Benchmark Dataset for Emulating Atmospheric Radiative Transfer in Weather and Climate Models
Salva R\"uhling Cachay, Venkatesh Ramesh, Jason N. S. Cole, Howard, Barker, David Rolnick

TL;DR
ClimART is a comprehensive dataset designed to benchmark machine learning models for emulating atmospheric radiative transfer in climate simulations, addressing the need for standardized evaluation and improved model performance.
Contribution
This work introduces ClimART, a large-scale dataset with over 10 million samples across various climate conditions, and provides baseline models and best practices for ML benchmarking in atmospheric physics.
Findings
Existing models show limitations in accuracy and speed.
The dataset reveals challenges in out-of-distribution generalization.
Baseline results highlight areas for future improvement.
Abstract
Numerical simulations of Earth's weather and climate require substantial amounts of computation. This has led to a growing interest in replacing subroutines that explicitly compute physical processes with approximate machine learning (ML) methods that are fast at inference time. Within weather and climate models, atmospheric radiative transfer (RT) calculations are especially expensive. This has made them a popular target for neural network-based emulators. However, prior work is hard to compare due to the lack of a comprehensive dataset and standardized best practices for ML benchmarking. To fill this gap, we build a large dataset, ClimART, with more than \emph{10 million samples from present, pre-industrial, and future climate conditions}, based on the Canadian Earth System Model. ClimART poses several methodological challenges for the ML community, such as multiple…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMeteorological Phenomena and Simulations · Solar Radiation and Photovoltaics · Climate variability and models
MethodsLearnable adjacency matrix GCN · 1-Dimensional Convolutional Neural Networks · Graph Convolutional Network
