HealDA: Highlighting the importance of initial errors in end-to-end AI weather forecasts
Aayush Gupta (1), Akshay Subramaniam (1), Michael S. Pritchard (1), Karthik Kashinath (1), Sergey Frolov (2), Kelsey Lieberman (3), Christopher Miller (3), Nicholas Silverman (3), Noah D. Brenowitz (1) ((1) NVIDIA Corporation, (2) NOAA, (3) MITRE Corporation)

TL;DR
HealDA is a novel ML-based data assimilation system that provides initial atmospheric states directly from observations, enabling ML weather forecasts with minimal loss of skill and without reliance on traditional NWP infrastructure.
Contribution
HealDA introduces a simple, direct observation-to-state ML data assimilation method that effectively initializes ML weather forecast models, reducing dependence on NWP data assimilation.
Findings
HealDA-initialized forecasts lose less than one day of effective lead time.
HealDA-initialized FCN3 ensembles trail ECMWF IFS ENS by less than 24 hours.
Forecast error growth remains unchanged from HealDA initialization, with skill primarily limited by initial analysis error.
Abstract
AI weather models now rival leading numerical weather prediction (NWP) systems in medium-range skill. However, almost all still rely on NWP data assimilation (DA) to provide initial conditions, tying them to expensive infrastructure and limiting the practical speed and accuracy gains of ML. More recently, ML-based DA systems have been proposed, which are often trained and evaluated end-to-end with a forecast model, making it difficult to assess the quality of their analysis fields. We introduce HealDA, a global ML-based DA system that maps a short window of satellite and conventional observations directly to a 1{\deg} atmospheric state on the HEALPix grid, using a smaller sensor suite than operational NWP. We treat HealDA strictly as a DA module: its analyses are used to initialize off-the-shelf ML forecast models without any fine-tuning of either. For a variety of off-the-shelf ML…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
