TL;DR
This paper presents a machine learning framework that classifies solar activity from compressed Mg II spectra data, achieving high accuracy and enabling efficient analysis of large solar datasets.
Contribution
It introduces a novel approach to classify solar activity directly on compressed data, reducing computational and bandwidth requirements while maintaining high classification accuracy.
Findings
XGBoost achieved over 95% accuracy on compressed data.
Classification performance on compressed data is comparable to uncompressed data.
The method enables efficient solar activity classification with reduced data complexity.
Abstract
Although large volumes of solar data are available for study, the vast majority of these data remain unlabeled and are therefore not amenable to supervised machine learning methods. Having a way to accurately and automatically classify spectra into categories related to solar activity is highly desirable and will assist and speed up future research efforts in solar physics. At the same time, the large volume of raw observational data is a serious bottleneck for machine learning, requiring powerful computational means that are not at the disposal of many laboratories. Besides, the raw data communication imposes restrictions on real time data observations and requires considerable bandwidth and energy for the onboard solar observation systems. To solve these issues, we propose a framework to classify solar activity on compressed data. For this, we used a labeling scheme from a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
