Strong Gravitational Lensing Parameter Estimation with Vision   Transformer

Kuan-Wei Huang; Geoff Chih-Fan Chen; Po-Wen Chang; Sheng-Chieh Lin,; Chia-Jung Hsu; Vishal Thengane; Joshua Yao-Yu Lin

arXiv:2210.04143·astro-ph.CO·October 11, 2022·5 cites

Strong Gravitational Lensing Parameter Estimation with Vision Transformer

Kuan-Wei Huang, Geoff Chih-Fan Chen, Po-Wen Chang, Sheng-Chieh Lin,, Chia-Jung Hsu, Vishal Thengane, Joshua Yao-Yu Lin

PDF

Open Access 1 Repo

TL;DR

This paper investigates the use of Vision Transformer models for parameter estimation in strong gravitational lensing, demonstrating competitive accuracy and efficiency compared to traditional CNN approaches, with implications for future large-scale surveys.

Contribution

First application of Vision Transformer to simulated strong gravitational lensing data, showing promising results and highlighting its potential for future lensing studies.

Findings

01

ViT achieves competitive results with CNNs in lens parameter estimation.

02

ViT performs well on key mass-related parameters such as lens center, ellipticities, and slope.

03

The approach offers a faster alternative to MCMC methods for large datasets.

Abstract

Quantifying the parameters and corresponding uncertainties of hundreds of strongly lensed quasar systems holds the key to resolving one of the most important scientific questions: the Hubble constant ( $H_{0}$ ) tension. The commonly used Markov chain Monte Carlo (MCMC) method has been too time-consuming to achieve this goal, yet recent work has shown that convolution neural networks (CNNs) can be an alternative with seven orders of magnitude improvement in speed. With 31,200 simulated strongly lensed quasar images, we explore the usage of Vision Transformer (ViT) for simulated strong gravitational lensing for the first time. We show that ViT could reach competitive results compared with CNNs, and is specifically good at some lensing parameters, including the most important mass-related parameters such as the center of lens $θ_{1}$ and $θ_{2}$ , the ellipticities $e_{1}$ and $e_{2}$ ,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kuanweih/strong_lensing_vit_resnet
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGalaxies: Formation, Evolution, Phenomena · Gamma-ray bursts and supernovae · Adaptive optics and wavefront sensing

MethodsAttention Is All You Need · Linear Layer · Position-Wise Feed-Forward Layer · Dropout · Softmax · Label Smoothing · Multi-Head Attention · Adam · Byte Pair Encoding · Absolute Position Encodings