Enhancing the reliability of machine learning for gravitational wave parameter estimation with attention-based models

Hibiki Iwanaga; Mahoro Matsuyama; Yousuke Itoh

arXiv:2501.10486·astro-ph.IM·October 15, 2025

Enhancing the reliability of machine learning for gravitational wave parameter estimation with attention-based models

Hibiki Iwanaga, Mahoro Matsuyama, Yousuke Itoh

PDF

Open Access

TL;DR

This paper presents attention-based models using Vision Transformers to improve the reliability of gravitational wave parameter estimation, providing interpretability and glitch impact analysis.

Contribution

Introduces attention map techniques in machine learning models for gravitational wave analysis to enhance interpretability and reliability.

Findings

01

Models focus on physically meaningful features

02

Attention maps reveal bias from glitches

03

Potential to distinguish reliable from unreliable estimates

Abstract

We introduce a technique to enhance the reliability of gravitational wave parameter estimation results produced by machine learning. We develop two independent machine learning models based on the Vision Transformer to estimate effective spin and chirp mass from spectrograms of gravitational wave signals from binary black hole mergers. To enhance the reliability of these models, we utilize attention maps to visualize the areas our models focus on when making predictions. This approach enables demonstrating that both models perform parameter estimation based on physically meaningful information. Furthermore, by leveraging these attention maps, we demonstrate a method to quantify the impact of glitches on parameter estimation. We show that as the models focus more on glitches, the parameter estimation results become more strongly biased. This suggests that attention maps could potentially…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational Physics and Python Applications · Geophysics and Gravity Measurements · Meteorological Phenomena and Simulations

MethodsAttention Is All You Need · Softmax · Residual Connection · Dropout · Absolute Position Encodings · Byte Pair Encoding · Linear Layer · Vision Transformer · Multi-Head Attention · Position-Wise Feed-Forward Layer