Classification and Ranking of Fermi LAT Gamma-ray Sources from the 3FGL   Catalog using Machine Learning Techniques

P. M. Saz Parkinson (HKU/LSR; SCIPP); H. Xu (HKU); P. L. H. Yu (HKU),; D. Salvetti (INAF); M. Marelli (INAF); A. D. Falcone (Penn State)

arXiv:1602.00385·astro-ph.HE·March 23, 2016

Classification and Ranking of Fermi LAT Gamma-ray Sources from the 3FGL Catalog using Machine Learning Techniques

P. M. Saz Parkinson (HKU/LSR, SCIPP), H. Xu (HKU), P. L. H. Yu (HKU),, D. Salvetti (INAF), M. Marelli (INAF), A. D. Falcone (Penn State)

PDF

TL;DR

This study employs machine learning techniques, especially Random Forest, to classify gamma-ray sources from the 3FGL catalog as pulsars or AGN with over 96% accuracy, and further distinguishes pulsar subtypes, aiding follow-up and population analysis.

Contribution

The paper introduces the application of machine learning algorithms, notably Random Forest, for high-accuracy classification and ranking of gamma-ray sources and pulsar subtypes in the 3FGL catalog, with validation on known sources.

Findings

01

Random Forest achieves >96% accuracy in classifying AGN and pulsars.

02

The models accurately distinguish young and millisecond pulsars with ~90% accuracy.

03

Predictions assist in identifying unassociated sources and potential X-ray counterparts.

Abstract

We apply a number of statistical and machine learning techniques to classify and rank gamma-ray sources from the Third Fermi Large Area Telescope (LAT) Source Catalog (3FGL), according to their likelihood of falling into the two major classes of gamma-ray emitters: pulsars (PSR) or Active Galactic Nuclei (AGN). Using 1904 3FGL sources that have been identified/associated with AGN (1738) and PSR (166), we train (using 70% of our sample) and test (using 30%) our algorithms and find that the best overall accuracy (>96%) is obtained with the Random Forest (RF) technique, while using a logistic regression (LR) algorithm results in only marginally lower accuracy. We apply the same techniques on a sub-sample of 142 known gamma-ray pulsars to classify them into two major subcategories: young (YNG) and millisecond pulsars (MSP). Once more, the RF algorithm has the best overall accuracy (~90%),…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.