TL;DR
This study employs CNN and LSTM deep learning models to predict solar flares using data from two solar cycles, demonstrating improved accuracy with ensemble methods and interpretability via attribution techniques.
Contribution
It introduces a stacking ensemble of CNN and LSTM for flare prediction and applies Integrated Gradients for model interpretability, highlighting the importance of multi-cycle data and magnetic flux features.
Findings
LSTM trained on two cycles outperforms single-cycle training with higher TSS.
Ensemble of CNN and LSTM yields higher TSS than individual models.
Integrated Gradients attribute flare predictions to magnetic flux, revealing model limitations.
Abstract
We consider the flare prediction problem that distinguishes flare-imminent active regions that produce an M- or X-class flare in the future 24 hours, from quiet active regions that do not produce any flare within hours. Using line-of-sight magnetograms and parameters of active regions in two data products covering Solar Cycle 23 and 24, we train and evaluate two deep learning algorithms -- CNN and LSTM -- and their stacking ensembles. The decisions of CNN are explained using visual attribution methods. We have the following three main findings. (1) LSTM trained on data from two solar cycles achieves significantly higher True Skill Scores (TSS) than that trained on data from a single solar cycle with a confidence level of at least 0.95. (2) On data from Solar Cycle 23, a stacking ensemble that combines predictions from LSTM and CNN using the TSS criterion achieves significantly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsTanh Activation · Sigmoid Activation · Long Short-Term Memory
