TL;DR
This paper introduces SHEEP, a machine learning pipeline that improves astronomical source classification by integrating photometric redshift estimation with ensemble classifiers, achieving high accuracy in distinguishing stars, galaxies, and quasars.
Contribution
The novel integration of photometric redshift estimation as a feature in ensemble learning significantly enhances classification performance in astronomy.
Findings
F1-score of 0.992 for galaxies
F1-score of 0.967 for quasars
Outperforms recent RandomForest-based methods
Abstract
We present SHEEP, a new machine learning approach to the classic problem of astronomical source classification, which combines the outputs from the XGBoost, LightGBM, and CatBoost learning algorithms to create stronger classifiers. A novel step in our pipeline is that prior to performing the classification, SHEEP first estimates photometric redshifts, which are then placed into the data set as an additional feature for classification model training; this results in significant improvements in the subsequent classification performance. SHEEP contains two distinct classification methodologies: (i) Multi-class and (ii) one versus all with correction by a meta-learner. We demonstrate the performance of SHEEP for the classification of stars, galaxies, and quasars using a data set composed of SDSS and WISE photometry of 3.5 million astronomical sources. The resulting F1-scores are as follows:…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
