Predicting dark matter halo masses from simulated galaxy images and environments
Austin J. Larson, John F. Wu, Craig Jones

TL;DR
This study evaluates machine learning models, including CNNs and GNNs, trained on simulated galaxy data to improve the prediction of dark matter halo masses beyond traditional methods.
Contribution
It introduces a combined CNN+GNN approach and demonstrates that deep learning models can utilize galaxy images and environments to better estimate halo masses.
Findings
GNNs outperform baseline models in halo mass prediction
CNNs are limited by small dataset size but show potential
Combined CNN+GNN achieves the lowest error metrics
Abstract
Galaxies are theorized to form and co-evolve with their dark matter halos, such that their stellar masses and halo masses should be well-correlated. However, it is not known whether other observable galaxy features, such as their morphologies or large-scale environments, can be used to tighten the correlation between galaxy properties and halo masses. In this work, we train a baseline random forest model to predict halo mass using galaxy features from the Illustris TNG50 hydrodynamical simulation, and compare with convolutional neural networks (CNNs) and graph neural networks (GNNs) trained respectively using galaxy image cutouts and galaxy point clouds. The best baseline model has a root mean squared error (RMSE) of 0.310 and mean absolute error (MAE) of 0.220, compared to the CNN (RSME=0.359, MAE=0.238), GNN (RMSE=0.248, MAE=0.158), and a novel combined CNN+GNN (RMSE=0.248,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCCD and CMOS Imaging Sensors · Astronomy and Astrophysical Research · Galaxies: Formation, Evolution, Phenomena
