# Benchmarks and Explanations for Deep Learning Estimates of X-ray Galaxy   Cluster Masses

**Authors:** Matthew Ho, John Soltis, Arya Farahi, Daisuke Nagai, August Evrard,, Michelle Ntampaka

arXiv: 2303.00005 · 2023-07-27

## TL;DR

This study evaluates deep learning models for estimating galaxy cluster masses from X-ray data, demonstrating significant improvements over traditional methods and providing interpretability insights into model focus areas.

## Contribution

The paper introduces deep learning approaches that improve galaxy cluster mass estimates from X-ray data and offers interpretability analysis of model decision regions.

## Key findings

- Deep learning reduces mass scatter to 17.8%, doubling accuracy of scalar observables.
- Multichannel X-ray input further decreases scatter to 16.2%.
- Incorporating dynamical and X-ray probes achieves 15.9% scatter.

## Abstract

We evaluate the effectiveness of deep learning (DL) models for reconstructing the masses of galaxy clusters using X-ray photometry data from next-generation surveys. We establish these constraints using a catalogue of realistic mock eROSITA X-ray observations which use hydrodynamical simulations to model realistic cluster morphology, background emission, telescope response, and AGN sources. Using bolometric X-ray photon maps as input, DL models achieve a predictive mass scatter of $\sigma_{\ln M_\mathrm{500c}} = 17.8\%$, a factor of two improvements on scalar observables such as richness $N_\mathrm{gal}$, 1D velocity dispersion $\sigma_\mathrm{v,1D}$, and photon count $N_\mathrm{phot}$ as well as a $32\%$ improvement upon idealised, volume-integrated measurements of the bolometric X-ray luminosity $L_X$. We then show that extending this model to handle multichannel X-ray photon maps, separated in low, medium, and high energy bands, further reduces the mass scatter to $16.2\%$. We also tested a multimodal DL model incorporating both dynamical and X-ray cluster probes and achieved marginal gains at a mass scatter of $15.9\%$. Finally, we conduct a quantitative interpretability study of our DL models and find that they greatly down-weight the importance of pixels in the centres of clusters and at the location of AGN sources, validating previous claims of DL modelling improvements and suggesting practical and theoretical benefits for using DL in X-ray mass inference.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/2303.00005/full.md

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/2303.00005/full.md

## References

86 references — full list in the complete paper: https://tomesphere.com/paper/2303.00005/full.md

---
Source: https://tomesphere.com/paper/2303.00005