Radio Galaxy Zoo: Using semi-supervised learning to leverage large unlabelled data-sets for radio galaxy classification under data-set shift
Inigo V. Slijepcevic, Anna M. M. Scaife, Mike Walmsley, Micah Bowles,, Ivy Wong, Stanislav S. Shabala, Hongming Tang

TL;DR
This study evaluates semi-supervised learning for radio galaxy classification, revealing its benefits, limitations under dataset shift, and the challenges in performance prediction using data-set shift measures.
Contribution
It demonstrates the performance and calibration limitations of SSL in radio galaxy classification, especially under dataset shift and class imbalance, and explores data-set shift measurement techniques.
Findings
SSL outperforms baseline accuracy within a narrow label volume range
SSL does not improve model calibration regardless of accuracy gains
Data-set shift significantly reduces SSL performance when training and unlabeled data differ
Abstract
In this work we examine the classification accuracy and robustness of a state-of-the-art semi-supervised learning (SSL) algorithm applied to the morphological classification of radio galaxies. We test if SSL with fewer labels can achieve test accuracies comparable to the supervised state-of-the-art and whether this holds when incorporating previously unseen data. We find that for the radio galaxy classification problem considered, SSL provides additional regularisation and outperforms the baseline test accuracy. However, in contrast to model performance metrics reported on computer science benchmarking data-sets, we find that improvement is limited to a narrow range of label volumes, with performance falling off rapidly at low label volumes. Additionally, we show that SSL does not improve model calibration, regardless of whether classification is improved. Moreover, we find that when…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
