Efficient labeling of solar flux evolution videos by a deep learning model
Subhamoy Chatterjee, Andr\'es Mu\~noz-Jaramillo, and Derek A. Lamb

TL;DR
This paper presents a deep learning approach that significantly reduces manual effort in labeling solar flux videos by iteratively refining CNN-based annotations and locating magnetic flux emergence events without retraining.
Contribution
The study introduces an iterative CNN-based labeling method that improves data quality and reduces manual verification by 50%, and enables detection of emergence times without retraining.
Findings
CNN reduces manual labeling effort by 50%
Iterative training improves label quality
Method locates emergence times without retraining
Abstract
Machine learning (ML) is becoming a critical tool for interrogation of large complex data. Labeling, defined as the process of adding meaningful annotations, is a crucial step of supervised ML. However, labeling datasets is time consuming. Here we show that convolutional neural networks (CNNs), trained on crudely labeled astronomical videos, can be leveraged to improve the quality of data labeling and reduce the need for human intervention. We use videos of the solar magnetic field, crudely labeled into two classes: emergence or non-emergence of bipolar magnetic regions (BMRs), based on their first detection on the solar disk. We train CNNs using crude labels, manually verify, correct labeling vs. CNN disagreements, and repeat this process until convergence. Traditionally, flux emergence labelling is done manually. We find that a high-quality labeled dataset, derived through this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
