Probabilistic learning for pulsar classification

Sambatra Andrianomena

arXiv:2205.05765·astro-ph.IM·October 12, 2022

Probabilistic learning for pulsar classification

Sambatra Andrianomena

PDF

TL;DR

This paper investigates probabilistic learning methods, specifically Deep Gaussian Processes and Deep Kernel Learning, for pulsar candidate classification, demonstrating high accuracy and effective uncertainty calibration even with imbalanced datasets.

Contribution

It introduces the application of DGP and DKL models for pulsar classification, compares their confidence and calibration, and explores Bayesian Active Learning with CNNs for small datasets.

Findings

01

DGP and DKL achieve ROC-AUC around 0.98.

02

DKL shows better confidence and calibration than DGP.

03

CNN with Bayesian Active Learning performs well with limited data.

Abstract

In this work, we explore the possibility of using probabilistic learning to identify pulsar candidates. We make use of Deep Gaussian Process (DGP) and Deep Kernel Learning (DKL). Trained on a balanced training set in order to avoid the effect of class imbalance, the performance of the models, achieving relatively high probability of differentiating the positive class from the negative one ( $r oc$ - $a u c \sim 0.98$ ), is very promising overall. We estimate the predictive entropy of each model predictions and find that DKL is more confident than DGP in its predictions and provides better uncertainty calibration. Upon investigating the effect of training with imbalanced dataset on the models, results show that each model performance decreases with an increasing number of the majority class in the training set. Interestingly, with a number of negative class $10 \times$ that of positive class,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.