A machine learning approach to infer the accreted stellar mass fractions of central galaxies in the TNG100 simulation
Rui Shi, Wenting Wang, Zhaozhou Li, Jiaxin Han, Jingjing Shi, Vicente, Rodriguez-Gomez, Yingjie Peng, Qingyang Li

TL;DR
This study uses a random forest machine learning method to predict the accreted stellar mass fractions of central galaxies in the TNG100 simulation, highlighting key features and limitations of current observational data.
Contribution
It introduces a machine learning framework for estimating accreted stellar mass fractions using galaxy and halo features, with insights into feature importance and observational constraints.
Findings
Prediction RMSE is ~0.068 with complete features.
Observable features yield a prediction RMSE of ~0.104.
Galaxy size, merger history, and morphology are key predictive features.
Abstract
We propose a random forest (RF) machine learning approach to determine the accreted stellar mass fractions () of central galaxies, based on various dark matter halo and galaxy features. The RF is trained and tested using 2,710 galaxies with stellar mass from the TNG100 simulation. Galaxy size is the most important individual feature when calculated in 3-dimensions, which becomes less important after accounting for observational effects. For smaller galaxies, the rankings for features related to merger histories increase. When an entire set of halo and galaxy features are used, the prediction is almost unbiased, with root-mean-square error (RMSE) of 0.068. A combination of up to three features with different types (galaxy size, merger history and morphology) already saturates the power of prediction. If using observable features, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
