Mining the information content of member galaxies in the halo mass modelling
Yanrui Zhou, Jiaxin Han

TL;DR
This study uses machine learning on simulation data to identify the most informative member galaxies for accurately estimating halo mass, revealing optimal galaxy combinations and their physical significance.
Contribution
It systematically analyzes the information content of different galaxy members in halo mass modeling using feature selection and machine learning, highlighting optimal galaxy combinations.
Findings
Adding satellites improves halo mass estimates, but the benefit plateaus after a few galaxies.
The best prediction uses the central galaxy plus the most and least massive satellites.
Two or three galaxies suffice for near-optimal halo mass estimation.
Abstract
Motivated by previous findings that the magnitude gap between certain satellite galaxy and the central galaxy can be used to improve the estimation of halo mass, we carry out a systematic study of the information content of different member galaxies in the modelling of the host halo mass using a machine learning approach. We employ data from the hydrodynamical simulation IllustrisTNG and train a Random Forest (RF) algorithm to predict a halo mass from the stellar masses of its member galaxies. Exhaustive feature selection is adopted to disentangle the importances of different galaxy members. We confirm that an additional satellite does improve the halo mass estimation compared to that estimated by the central alone. However, the magnitude of this improvement does not differ significantly using different satellite galaxies. When three galaxies are used in the halo mass prediction, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGalaxies: Formation, Evolution, Phenomena
