Radio Galaxy Zoo: Using semi-supervised learning to leverage large   unlabelled data-sets for radio galaxy classification under data-set shift

Inigo V. Slijepcevic; Anna M. M. Scaife; Mike Walmsley; Micah Bowles,; Ivy Wong; Stanislav S. Shabala; Hongming Tang

arXiv:2204.08816·astro-ph.GA·May 11, 2022

Radio Galaxy Zoo: Using semi-supervised learning to leverage large unlabelled data-sets for radio galaxy classification under data-set shift

Inigo V. Slijepcevic, Anna M. M. Scaife, Mike Walmsley, Micah Bowles,, Ivy Wong, Stanislav S. Shabala, Hongming Tang

PDF

1 Repo

TL;DR

This study evaluates semi-supervised learning for radio galaxy classification, revealing its benefits, limitations under dataset shift, and the challenges in performance prediction using data-set shift measures.

Contribution

It demonstrates the performance and calibration limitations of SSL in radio galaxy classification, especially under dataset shift and class imbalance, and explores data-set shift measurement techniques.

Findings

01

SSL outperforms baseline accuracy within a narrow label volume range

02

SSL does not improve model calibration regardless of accuracy gains

03

Data-set shift significantly reduces SSL performance when training and unlabeled data differ

Abstract

In this work we examine the classification accuracy and robustness of a state-of-the-art semi-supervised learning (SSL) algorithm applied to the morphological classification of radio galaxies. We test if SSL with fewer labels can achieve test accuracies comparable to the supervised state-of-the-art and whether this holds when incorporating previously unseen data. We find that for the radio galaxy classification problem considered, SSL provides additional regularisation and outperforms the baseline test accuracy. However, in contrast to model performance metrics reported on computer science benchmarking data-sets, we find that improvement is limited to a narrow range of label volumes, with performance falling off rapidly at low label volumes. Additionally, we show that SSL does not improve model calibration, regardless of whether classification is improved. Moreover, we find that when…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

inigoval/fixmatch
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.