Finding Strong Gravitational Lenses Through Self-Attention

Hareesh Thuruthipilly; Adam Zadrozny; Agnieszka Pollo; Marek; Biesiada

arXiv:2110.09202·cs.CV·December 27, 2022

Finding Strong Gravitational Lenses Through Self-Attention

Hareesh Thuruthipilly, Adam Zadrozny, Agnieszka Pollo, Marek, Biesiada

PDF

TL;DR

This paper introduces a self-attention based neural network architecture for automatically identifying strong gravitational lenses in large astronomical survey data, outperforming traditional CNNs in accuracy and confidence.

Contribution

The study develops and evaluates self-attention encoder models, demonstrating their superior performance over CNNs in gravitational lens detection tasks.

Findings

01

Self-attention models outperform CNNs in TPR metrics.

02

Self-attention models can identify highly confident lens candidates.

03

Encoder layers help reduce overfitting in models.

Abstract

The upcoming large scale surveys like LSST are expected to find approximately $1 0^{5}$ strong gravitational lenses by analysing data of many orders of magnitude larger than those in contemporary astronomical surveys. In this case, non-automated techniques will be highly challenging and time-consuming, even if they are possible at all. We propose a new automated architecture based on the principle of self-attention to find strong gravitational lenses. The advantages of self-attention-based encoder models over convolution neural networks are investigated, and ways to optimise the outcome of encoder models are analysed. We constructed and trained 21 self-attention based encoder models and five convolution neural networks to identify gravitational lenses from the Bologna Lens Challenge. Each model was trained separately using 18,000 simulated images, cross-validated using 2,000 images, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsTest · Convolution