WeatherBench 2: A benchmark for the next generation of data-driven global weather models
Stephan Rasp, Stephan Hoyer, Alexander Merose, Ian Langmore, Peter, Battaglia, Tyler Russel, Alvaro Sanchez-Gonzalez, Vivian Yang, Rob Carver,, Shreya Agrawal, Matthew Chantry, Zied Ben Bouallegue, Peter Dueben, Carla, Bromberg, Jared Sisk, Luke Barrington, Aaron Bell, Fei Sha

TL;DR
WeatherBench 2 is a comprehensive, open-source benchmark designed to evaluate and accelerate the development of next-generation data-driven global weather forecasting models, incorporating new metrics, datasets, and state-of-the-art results.
Contribution
It introduces an updated evaluation framework with new metrics, datasets, and baseline models to facilitate progress in data-driven weather prediction.
Findings
Current state-of-the-art models achieve improved forecast accuracy.
The benchmark highlights key challenges and limitations in data-driven weather modeling.
Open-source tools enable reproducibility and community-driven advancements.
Abstract
WeatherBench 2 is an update to the global, medium-range (1-14 day) weather forecasting benchmark proposed by Rasp et al. (2020), designed with the aim to accelerate progress in data-driven weather modeling. WeatherBench 2 consists of an open-source evaluation framework, publicly available training, ground truth and baseline data as well as a continuously updated website with the latest metrics and state-of-the-art models: https://sites.research.google/weatherbench. This paper describes the design principles of the evaluation framework and presents results for current state-of-the-art physical and data-driven weather models. The metrics are based on established practices for evaluating weather forecasts at leading operational weather centers. We define a set of headline scores to provide an overview of model performance. In addition, we also discuss caveats in the current evaluation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMeteorological Phenomena and Simulations · Precipitation Measurement and Analysis · Hydrological Forecasting Using AI
