Automated classification of Chandra X-ray point sources using machine learning methods
Shivam Kumaran (IIST, India), Samir Mandal (IIST, India), Sudip, Bhattacharyya (TIFR, India), Deepak Mishra (IIST, India)

TL;DR
This paper develops a machine learning classifier to automatically identify various types of X-ray point sources in the Chandra Source Catalogue using multi-wavelength data, achieving over 93% accuracy.
Contribution
The study introduces a Light Gradient Boosted Machine classifier trained on multi-wavelength features, enabling high-precision identification of diverse X-ray sources in astronomical catalogues.
Findings
Achieved 93% precision and recall in classification.
Identified over 54,000 sources with high confidence.
Demonstrated applicability to other astronomical catalogues.
Abstract
A large number of unidentified sources found by astronomical surveys and other observations necessitate the use of an automated classification technique based on machine learning methods. The aim of this paper is to find a suitable automated classifier to identify the point X-ray sources in the Chandra Source Catalogue (CSC) 2.0 in the categories of active galactic nuclei (AGN), X-ray emitting stars, young stellar objects (YSOs), high-mass X-ray binaries (HMXBs), low-mass X-ray binaries (LMXBs), ultra luminous X-ray sources (ULXs), cataclysmic variables (CVs), and pulsars. The catalogue consists of approx 3,17,000 sources, out of which we select 2,77,069 point sources based on the quality flags available in CSC 2.0. In order to identify unknown sources of CSC 2.0, we use multi-wavelength features, such as magnitudes in optical/UV bands from Gaia-EDR3, SDSS and GALEX, and magnitudes in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
