TL;DR
This paper introduces a semi-supervised GAN approach for pulsar candidate identification that significantly improves classification accuracy with limited labeled data, aiding early-stage surveys and benchmarking.
Contribution
The study presents a novel semi-supervised GAN method that outperforms standard supervised algorithms in pulsar classification with minimal labeled data.
Findings
Achieved 94.9% accuracy with only 100 labeled and 5000 unlabeled candidates.
Final model reached 99.2% accuracy and 99.7% recall on larger labeled dataset.
Open-sourced a new large pulsar candidate dataset for benchmarking.
Abstract
Machine learning methods are increasingly helping astronomers identify new radio pulsars. However, they require a large amount of labelled data, which is time consuming to produce and biased. Here we describe a Semi-Supervised Generative Adversarial Network (SGAN) which achieves better classification performance than the standard supervised algorithms using majority unlabelled datasets. We achieved an accuracy and mean F-Score of 94.9% trained on only 100 labelled candidates and 5000 unlabelled candidates compared to our standard supervised baseline which scored at 81.1% and 82.7% respectively. Our final model trained on a much larger labelled dataset achieved an accuracy and mean F-score value of 99.2% and a recall rate of 99.7%. This technique allows for high quality classification during the early stages of pulsar surveys on new instruments when limited labelled data is available. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
