Toward Reliable Benchmarking of Solar Flare Forecasting Methods
D. Shaun Bloomfield, Paul A. Higgins, R. T. James McAteer, Peter T., Gallagher

TL;DR
This paper evaluates the reliability of solar flare forecasting methods by comparing Poisson probability models with other systems, emphasizing the importance of proper verification metrics like TSS for assessing forecast accuracy.
Contribution
It introduces a standardized approach using Poisson probabilities and advocates for the true skill statistic (TSS) as a benchmark for solar flare forecast evaluation.
Findings
Poisson probabilities perform comparably to complex models.
Maximum TSS values vary across flare classes, indicating forecast difficulty.
Using average flare rates has limitations in representing flaring probability distributions.
Abstract
Solar flares occur in complex sunspot groups, but it remains unclear how the probability of producing a flare of a given magnitude relates to the characteristics of the sunspot group. Here, we use Geostationary Operational Environmental Satellite X-ray flares and McIntosh group classifications from solar cycles 21 and 22 to calculate average flare rates for each McIntosh class and use these to determine Poisson probabilities for different flare magnitudes. Forecast verification measures are studied to find optimum thresholds to convert Poisson flare probabilities into yes/no predictions of cycle 23 flares. A case is presented to adopt the true skill statistic (TSS) as a standard for forecast comparison over the commonly used Heidke skill score (HSS). In predicting flares over 24 hr, the maximum values of TSS achieved are 0.44 (C-class), 0.53 (M-class), 0.74 (X-class), 0.54 (>=M1.0), and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
