A sparse regression approach to modeling the relation between galaxy stellar masses and their host halos
M. Icaza-Lizaola, Richard G. Bower, Peder Norberg, Shaun Cole,, Matthieu Schaller, Stefan Egan

TL;DR
This paper employs sparse regression to develop an explainable model linking galaxy stellar mass to host halo properties, demonstrating high accuracy and revealing minimal influence of halo angular momentum.
Contribution
It introduces a sparse regression framework that effectively models galaxy stellar mass from halo data, emphasizing the role of halo mass evolution and discarding angular momentum.
Findings
Achieves RMSE of 0.167 in stellar mass predictions
Reproduces galaxy stellar mass function and correlation functions accurately
Finds halo angular momentum has little connection to galaxy formation efficiency
Abstract
Sparse regression algorithms have been proposed as the appropriate framework to model the governing equations of a system from data, without needing prior knowledge of the underlying physics. In this work, we use sparse regression to build an accurate and explainable model of the stellar mass of central galaxies given properties of their host dark matter (DM) halo. Our data set comprises 9,521 central galaxies from the EAGLE hydrodynamic simulation. By matching the host halos to a DM-only simulation, we collect the halo mass and specific angular momentum at present time and for their main progenitors in 10 redshift bins from to . The principal component of our governing equation is a third-order polynomial of the host halo mass, which models the stellar-mass halo-mass relation. The scatter about this relation is driven by the halo mass evolution and is captured by second and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical and numerical algorithms · Advanced Statistical Methods and Models · Statistical Methods and Inference
