The VMC Survey : LI. Classifying extragalactic sources using a probabilistic random forest supervised machine learning algorithm
Clara M. Pennock, Jacco Th. van Loon, Maria-Rosa L. Cioni, Chandreyee, Maitra, Joana M. Oliveira, Jessica E. M. Craig, Valentin D. Ivanov, James, Aird, Joy. O. Anih, Nicholas J. G. Cross, Francesca Dresbach, Richard de, Grijs, Martin A. T. Groenewegen

TL;DR
This study applies a probabilistic random forest machine learning algorithm to classify nearly 130 million sources in the VISTA Survey of the Magellanic Clouds, achieving high accuracy and discovering thousands of new extragalactic and stellar candidates.
Contribution
The paper introduces a novel application of probabilistic random forest for large-scale classification of astronomical sources using multi-wavelength data, including new spectroscopic observations.
Findings
Achieved up to 98% classification accuracy for LMC sources.
Discovered over 49,500 new AGN candidates and 26,500 new galaxy candidates.
Classified thousands of sources as extragalactic, stellar, or unknown, with validation against independent datasets.
Abstract
We used a supervised machine learning algorithm (probabilistic random forest) to classify ~130 million sources in the VISTA Survey of the Magellanic Clouds (VMC). We used multi-wavelength photometry from optical to far-infrared as features to be trained on, and spectra of Active Galactic Nuclei (AGN), galaxies and a range of stellar classes including from new observations with the Southern African Large Telescope (SALT) and SAAO 1.9m telescope. We also retain a label for sources that remain unknown. This yielded average classifier accuracies of ~79% (SMC) and ~87% (LMC). Restricting to the 56,696,719 sources with class probabilities (P) > 80% yields accuracies of ~90% (SMC) and ~98% (LMC). After removing sources classed as 'Unknown', we classify a total of 707,939 (SMC) and 397,899 (LMC) sources, including > 77,600 extragalactic sources behind the Magellanic Clouds. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAstronomical Observations and Instrumentation · Astrophysics and Cosmic Phenomena
