Exploring the design space of deep-learning-based weather forecasting systems
Shoaib Ahmed Siddiqui, Jean Kossaifi, Boris Bonev, Christopher Choy,, Jan Kautz, David Krueger, Kamyar Azizzadenesheli

TL;DR
This paper systematically analyzes various design choices in deep-learning-based weather forecasting systems, comparing architectures, training schemes, and datasets to identify best practices and propose a hybrid model for improved performance.
Contribution
It provides a comprehensive analysis of design options and introduces a hybrid system combining fixed-grid and grid-invariant architectures for better weather forecasting.
Findings
Fixed-grid architectures outperform grid-invariant models.
Multi-step fine-tuning is crucial for model performance.
Larger datasets benefit smaller models significantly.
Abstract
Despite tremendous progress in developing deep-learning-based weather forecasting systems, their design space, including the impact of different design choices, is yet to be well understood. This paper aims to fill this knowledge gap by systematically analyzing these choices including architecture, problem formulation, pretraining scheme, use of image-based pretrained models, loss functions, noise injection, multi-step inputs, additional static masks, multi-step finetuning (including larger stride models), as well as training on a larger dataset. We study fixed-grid architectures such as UNet, fully convolutional architectures, and transformer-based models, along with grid-invariant architectures, including graph-based and operator-based models. Our results show that fixed-grid architectures outperform grid-invariant architectures, indicating a need for further architectural…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Visualization and Analytics · Traffic Prediction and Management Techniques · Educational and Technological Research
