Finding Quasars behind the Galactic Plane. IV. Candidate Selection from Chandra with Random Forest
Xu Zhang, Yanli Ai, Yanxia Zhang, Yuming Fu, Xue-Bing Wu, Zhiying Huo, Wenfeng Wen, Jiayuan Zhou, Dexuan Kong, Linfeng Zeng, Heng Wang

TL;DR
This study develops a machine learning framework using Chandra X-ray data, Gaia optical data, and mid-infrared data to identify quasar candidates behind the Galactic plane, significantly expanding the known sample.
Contribution
It introduces a novel Random Forest-based method combining multi-wavelength data to efficiently select low-latitude quasar candidates, including many previously undetected.
Findings
Identified 7570 quasar candidates behind the Galactic plane.
Discovered 1060 high-confidence Galactic Plane Quasar candidates.
Confirmed two candidates as quasars through spectroscopy.
Abstract
Quasar samples remain severely incomplete at low Galactic latitudes because of strong extinction and source confusion. We conduct a systematic search for quasars behind the Galactic plane using X-ray sources from the Chandra Source Catalog (CSC 2.1), combined with optical data from Gaia DR3 and mid-infrared data from CatWISE2020. Using spectroscopically confirmed quasars and stellar-type objects from data sets including DESI, SDSS, and LAMOST, we apply a Random Forest classifier to identify quasar candidates, with stellar contaminants suppressed using Gaia proper-motion constraints. Photometric redshifts are estimated for the candidates using a Random Forest regression model. Applying this framework to previously unclassified CSC sources, we identify 7570 quasar candidates, including 1060 Galactic Plane Quasar (GPQ) candidates at |b|<20{\deg}, of which 551 are high-confidence…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
