A Deep Learning Approach to Dst Index Prediction
Yasser Abduallah, Jason T. L. Wang, Prianka Bose, Genwei Zhang, Firas, Gerges, Haimin Wang

TL;DR
This paper introduces the Dst Transformer, a deep learning model that uses attention and Bayesian inference to improve short-term geomagnetic storm predictions and quantify uncertainties, outperforming existing methods.
Contribution
It presents the first application of Bayesian deep learning to Dst index forecasting, combining attention mechanisms with uncertainty quantification.
Findings
Outperforms existing machine learning methods in accuracy.
Provides both data and model uncertainty estimates.
Achieves better root mean square error and R-squared scores.
Abstract
The disturbance storm time (Dst) index is an important and useful measurement in space weather research. It has been used to characterize the size and intensity of a geomagnetic storm. A negative Dst value means that the Earth's magnetic field is weakened, which happens during storms. In this paper, we present a novel deep learning method, called the Dst Transformer, to perform short-term, 1-6 hour ahead, forecasting of the Dst index based on the solar wind parameters provided by the NASA Space Science Data Coordinated Archive. The Dst Transformer combines a multi-head attention layer with Bayesian inference, which is capable of quantifying both aleatoric uncertainty and epistemic uncertainty when making Dst predictions. Experimental results show that the proposed Dst Transformer outperforms related machine learning methods in terms of the root mean square error and R-squared.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSolar and Space Plasma Dynamics · Ionosphere and magnetosphere dynamics · Solar Radiation and Photovoltaics
MethodsDynamic Sparse Training · Attention Is All You Need · Linear Layer · Layer Normalization · Byte Pair Encoding · Absolute Position Encodings · Residual Connection · Softmax · Multi-Head Attention · Label Smoothing
