Statistical Study of 2XMMi-DR3/SDSS-DR8 Cross-correlation Sample
Zhang Yan-Xia, Zhou Xin-Lin, Zhao Yong-Heng, Wu Xue-Bing

TL;DR
This study cross-correlates X-ray and optical catalogs to analyze the distribution of X-ray emitters, applying machine learning for classification, and highlights challenges in distinguishing X-ray active stars from quasars and galaxies.
Contribution
It provides a large X-ray/optical catalog and evaluates the effectiveness of a random forest classifier in identifying different X-ray source types.
Findings
Quasars and galaxies occupy distinct regions in photometric space.
Classification accuracy exceeds 93% for quasars and galaxies.
X-ray active stars are harder to distinguish, with only 45.3% accuracy.
Abstract
Cross-correlating the XMM-Newton 2XMMi-DR3 catalog with the Sloan Digital Sky Survey (SDSS) Data Release 8, we obtain one of the largest X-ray/optical catalogs and explore the distribution of various classes of X-ray emitters in the multidimensional photometric parameter space. Quasars and galaxies occupy different zones while stars scatter in them. However, X-ray active stars have a certain distributing rule according to spectral types. The earlier the type of stars, the stronger X-ray emitting. X-ray active stars have a similar distribution to most of stars in the g-r versus r-i diagram. Based on the identified samples with SDSS spectral classification, a random forest algorithm for automatic classification is performed. The result shows that the classification accuracy of quasars and galaxies adds up to more than 93.0% while that of X-ray emitting stars only amounts to 45.3%. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
