CaFA: Global Weather Forecasting with Factorized Attention on Sphere
Zijie Li, Anthony Zhou, Saurabh Patil, Amir Barati Farimani

TL;DR
This paper introduces CaFA, a factorized attention model for global weather forecasting on spherical surfaces, achieving state-of-the-art accuracy with reduced computational complexity compared to traditional Transformer models.
Contribution
The paper presents a novel factorized-attention mechanism tailored for spherical geometries, significantly reducing computational costs while maintaining high forecasting accuracy.
Findings
Achieves accuracy comparable to state-of-the-art models.
Reduces computational complexity from quadratic to axial resolution.
Enhances the accuracy-efficiency trade-off in Transformer-based weather prediction.
Abstract
Accurate weather forecasting is crucial in various sectors, impacting decision-making processes and societal events. Data-driven approaches based on machine learning models have recently emerged as a promising alternative to numerical weather prediction models given their potential to capture physics of different scales from historical data and the significantly lower computational cost during the prediction stage. Renowned for its state-of-the-art performance across diverse domains, the Transformer model has also gained popularity in machine learning weather prediction. Yet applying Transformer architectures to weather forecasting, particularly on a global scale is computationally challenging due to the quadratic complexity of attention and the quadratic increase in spatial points as resolution increases. In this work, we propose a factorized-attention-based model tailored for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Physics and Python Applications · Advanced Computational Techniques and Applications · Hydrological Forecasting Using AI
MethodsAttention Is All You Need · Linear Layer · Multi-Head Attention · Dense Connections · Position-Wise Feed-Forward Layer · Dropout · Label Smoothing · Residual Connection · Absolute Position Encodings · Byte Pair Encoding
