Strong Gravitational Lensing Parameter Estimation with Vision Transformer
Kuan-Wei Huang, Geoff Chih-Fan Chen, Po-Wen Chang, Sheng-Chieh Lin,, Chia-Jung Hsu, Vishal Thengane, Joshua Yao-Yu Lin

TL;DR
This paper investigates the use of Vision Transformer models for parameter estimation in strong gravitational lensing, demonstrating competitive accuracy and efficiency compared to traditional CNN approaches, with implications for future large-scale surveys.
Contribution
First application of Vision Transformer to simulated strong gravitational lensing data, showing promising results and highlighting its potential for future lensing studies.
Findings
ViT achieves competitive results with CNNs in lens parameter estimation.
ViT performs well on key mass-related parameters such as lens center, ellipticities, and slope.
The approach offers a faster alternative to MCMC methods for large datasets.
Abstract
Quantifying the parameters and corresponding uncertainties of hundreds of strongly lensed quasar systems holds the key to resolving one of the most important scientific questions: the Hubble constant () tension. The commonly used Markov chain Monte Carlo (MCMC) method has been too time-consuming to achieve this goal, yet recent work has shown that convolution neural networks (CNNs) can be an alternative with seven orders of magnitude improvement in speed. With 31,200 simulated strongly lensed quasar images, we explore the usage of Vision Transformer (ViT) for simulated strong gravitational lensing for the first time. We show that ViT could reach competitive results compared with CNNs, and is specifically good at some lensing parameters, including the most important mass-related parameters such as the center of lens and , the ellipticities and ,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGalaxies: Formation, Evolution, Phenomena · Gamma-ray bursts and supernovae · Adaptive optics and wavefront sensing
MethodsAttention Is All You Need · Linear Layer · Position-Wise Feed-Forward Layer · Dropout · Softmax · Label Smoothing · Multi-Head Attention · Adam · Byte Pair Encoding · Absolute Position Encodings
