ClimaX: A foundation model for weather and climate
Tung Nguyen, Johannes Brandstetter, Ashish Kapoor, Jayesh K. Gupta,, Aditya Grover

TL;DR
ClimaX is a versatile deep learning foundation model for weather and climate prediction, capable of leveraging heterogeneous datasets and outperforming existing models on various benchmarks.
Contribution
We introduce ClimaX, a novel Transformer-based model that generalizes across diverse climate datasets and tasks, enabling improved weather and climate predictions.
Findings
ClimaX outperforms existing data-driven models on weather forecasting benchmarks.
Pretraining at lower resolutions still yields strong performance after fine-tuning.
ClimaX demonstrates flexibility across different variables and spatiotemporal scales.
Abstract
Most state-of-the-art approaches for weather and climate modeling are based on physics-informed numerical models of the atmosphere. These approaches aim to model the non-linear dynamics and complex interactions between multiple variables, which are challenging to approximate. Additionally, many such numerical models are computationally intensive, especially when modeling the atmospheric phenomenon at a fine-grained spatial and temporal resolution. Recent data-driven approaches based on machine learning instead aim to directly solve a downstream forecasting or projection task by learning a data-driven functional mapping using deep neural networks. However, these networks are trained using curated and homogeneous climate datasets for specific spatiotemporal tasks, and thus lack the generality of numerical models. We develop and demonstrate ClimaX, a flexible and generalizable deep…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMeteorological Phenomena and Simulations · Climate variability and models · Hydrological Forecasting Using AI
MethodsMulti-Head Attention · Attention Is All You Need · Dense Connections · Adam · Position-Wise Feed-Forward Layer · Softmax · Linear Layer · Absolute Position Encodings · Dropout · Label Smoothing
