Attention-gating for improved radio galaxy classification

Micah Bowles; Anna M. M. Scaife; Fiona Porter; Hongming Tang; David J.; Bastien

arXiv:2012.01248·astro-ph.GA·February 2, 2021

Attention-gating for improved radio galaxy classification

Micah Bowles, Anna M. M. Scaife, Fiona Porter, Hongming Tang, David J., Bastien

PDF

2 Repos

TL;DR

This paper introduces an attention-based convolutional neural network for radio galaxy classification that achieves comparable accuracy to existing models but with fewer parameters, and offers interpretability aligned with expert human classification.

Contribution

The study presents a novel attention-gating mechanism for CNNs in radio galaxy classification, improving interpretability and reducing model complexity.

Findings

01

Model performs on par with previous classifiers.

02

Uses over 50% fewer parameters than comparable CNNs.

03

Attention maps align with expert human classification regions.

Abstract

In this work we introduce attention as a state of the art mechanism for classification of radio galaxies using convolutional neural networks. We present an attention-based model that performs on par with previous classifiers while using more than 50% fewer parameters than the next smallest classic CNN application in this field. We demonstrate quantitatively how the selection of normalisation and aggregation methods used in attention-gating can affect the output of individual models, and show that the resulting attention maps can be used to interpret the classification choices made by the model. We observe that the salient regions identified by the our model align well with the regions an expert human classifier would attend to make equivalent classifications. We show that while the selection of normalisation and aggregation may only minimally affect the performance of individual models,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsInterpretability