Machine-learning identification of galaxies in the WISExSuperCOSMOS all-sky catalogue
T. Krakowski, K. Ma{\l}ek, M. Bilicki, A. Pollo, M. Krupa, A. Kurcz

TL;DR
This paper presents a machine learning approach using support vector machines to identify galaxies in the WISExSuperCOSMOS all-sky catalogue, achieving higher purity and reliability than traditional colour cut methods.
Contribution
It introduces an SVM-based classification method for galaxy identification in large sky surveys, improving over previous colour cut techniques.
Findings
Classifier accuracy >95% for bright sources
Purity of galaxy sample exceeds previous methods
Effective classification across different Galactic latitudes
Abstract
The two currently largest all-sky photometric datasets, WISE and SuperCOSMOS, were cross-matched by Bilicki et al. (2016) (B16) to construct a novel photometric redshift catalogue on 70% of the sky. Galaxies were therein separated from stars and quasars through colour cuts, which may leave imperfections because of mixing different source types which overlap in colour space. The aim of the present work is to identify galaxies in the WISExSuperCOSMOS catalogue through an alternative approach of machine learning. This allows us to define more complex separations in the multi-colour space than possible with simple colour cuts, and should provide more reliable source classification. For the automatised classification we use the support vector machines learning algorithm, employing SDSS spectroscopic sources cross-matched with WISExSuperCOSMOS as the training and verification set. We perform…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
