Finding Strong Gravitational Lenses Through Self-Attention
Hareesh Thuruthipilly, Adam Zadrozny, Agnieszka Pollo, Marek, Biesiada

TL;DR
This paper introduces a self-attention based neural network architecture for automatically identifying strong gravitational lenses in large astronomical survey data, outperforming traditional CNNs in accuracy and confidence.
Contribution
The study develops and evaluates self-attention encoder models, demonstrating their superior performance over CNNs in gravitational lens detection tasks.
Findings
Self-attention models outperform CNNs in TPR metrics.
Self-attention models can identify highly confident lens candidates.
Encoder layers help reduce overfitting in models.
Abstract
The upcoming large scale surveys like LSST are expected to find approximately strong gravitational lenses by analysing data of many orders of magnitude larger than those in contemporary astronomical surveys. In this case, non-automated techniques will be highly challenging and time-consuming, even if they are possible at all. We propose a new automated architecture based on the principle of self-attention to find strong gravitational lenses. The advantages of self-attention-based encoder models over convolution neural networks are investigated, and ways to optimise the outcome of encoder models are analysed. We constructed and trained 21 self-attention based encoder models and five convolution neural networks to identify gravitational lenses from the Bologna Lens Challenge. Each model was trained separately using 18,000 simulated images, cross-validated using 2,000 images, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsTest · Convolution
