Challenges with Extreme Class-Imbalance and Temporal Coherence: A Study on Solar Flare Data
Azim Ahmadzadeh, Maxwell Hostetter, Berkay Aydin, Manolis K., Georgoulis, Dustin J. Kempton, Sushant S. Mahajan, and Rafal A. Angryk

TL;DR
This study investigates the impacts of class imbalance and temporal coherence on solar flare prediction, highlighting pitfalls in sampling methods and proposing best practices for analyzing rare-event time series data.
Contribution
It reveals how temporal coherence affects sampling validity and performance evaluation, providing guidelines for handling class imbalance and autocorrelation in rare-event time series analysis.
Findings
Temporal coherence can artificially inflate forecast performance.
Sampling methods must account for autocorrelation to avoid misleading results.
Proper practices improve reliability in rare-event time series analysis.
Abstract
In analyses of rare-events, regardless of the domain of application, class-imbalance issue is intrinsic. Although the challenges are known to data experts, their explicit impact on the analytic and the decisions made based on the findings are often overlooked. This is in particular prevalent in interdisciplinary research where the theoretical aspects are sometimes overshadowed by the challenges of the application. To show-case these undesirable impacts, we conduct a series of experiments on a recently created benchmark data, named Space Weather ANalytics for Solar Flares (SWAN-SF). This is a multivariate time series dataset of magnetic parameters of active regions. As a remedy for the imbalance issue, we study the impact of data manipulation (undersampling and oversampling) and model manipulation (using class weights). Furthermore, we bring to focus the auto-correlation of time series…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
