# A Comparison of Flare Forecasting Methods. III. Systematic Behaviors of   Operational Solar Flare Forecasting Systems

**Authors:** K.D. Leka, Sung-Hong Park, Kanya Kusano, Jesse Andries, Graham Barnes,, Suzy Bingham, D. Shaun Bloomfield, Aoife E. McCloskey, Veronique Delouille,, David Falconer, Peter T. Gallagher, Manolis K. Georgoulis, Yuki Kubo, Kangjin, Lee, Sangwoo Lee, Vasily Lobzin, JunChul Mun, Sophie A. Murray, Tarek A.M., Hamad Nageem, Rami Qahwaji, Michael Sharpe, Rob Steenburgh, Graham Steward,, Michael Terkildsen

arXiv: 1907.02909 · 2019-09-04

## TL;DR

This paper analyzes the behavior and performance of operational solar flare forecasting systems, highlighting how different factors influence forecast accuracy and emphasizing the importance of methodological comparisons.

## Contribution

It systematically evaluates how implementation differences affect the performance of solar flare forecasting methods, building on previous comparison studies.

## Key findings

- Including persistence and human input improves forecasts.
- Restricting data to disk-center observations reduces performance.
- Combining long-term statistics with modern data sources can enhance accuracy.

## Abstract

A workshop was recently held at Nagoya University (31 October - 02 November 2017), sponsored by the Center for International Collaborative Research, at the Institute for Space-Earth Environmental Research, Nagoya University, Japan, to quantitatively compare the performance of today's operational solar flare forecasting facilities. Building upon Paper I of this series (Barnes et al. 2016), in Paper II (Leka et al. 2019) we described the participating methods for this latest comparison effort, the evaluation methodology, and presented quantitative comparisons. In this paper we focus on the behavior and performance of the methods when evaluated in the context of broad implementation differences. Acknowledging the short testing interval available and the small number of methods available, we do find that forecast performance: 1) appears to improve by including persistence or prior flare activity, region evolution, and a human "forecaster in the loop"; 2) is hurt by restricting data to disk-center observations; 3) may benefit from long-term statistics, but mostly when then combined with modern data sources and statistical approaches. These trends are arguably weak and must be viewed with numerous caveats, as discussed both here and in Paper II. Following this present work, we present in Paper IV a novel analysis method to evaluate temporal patterns of forecasting errors of both types (i.e., misses and false alarms; Park et al. 2019). Hence, most importantly, with this series of papers we demonstrate the techniques for facilitating comparisons in the interest of establishing performance-positive methodologies.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1907.02909/full.md

## Figures

25 figures with captions in the complete paper: https://tomesphere.com/paper/1907.02909/full.md

## References

40 references — full list in the complete paper: https://tomesphere.com/paper/1907.02909/full.md

---
Source: https://tomesphere.com/paper/1907.02909