A machine learning approach to galaxy properties: joint redshift-stellar mass probability distributions with Random Forest
S. Mucesh, W. G. Hartley, A. Palmese, O. Lahav, L. Whiteway, A. F. L., Bluck, A. Alarcon, A. Amon, K. Bechtol, G. M. Bernstein, A. Carnero Rosell,, M. Carrasco Kind, A. Choi, K. Eckert, S. Everett, D. Gruen, R. A. Gruendl, I., Harrison, E. M. Huff, N. Kuropatkin

TL;DR
This paper presents a machine learning method using Random Forests to accurately and rapidly generate joint redshift-stellar mass probability distributions for galaxies, outperforming traditional template-fitting techniques.
Contribution
The authors develop a novel ML approach with a Python package that produces high-quality joint PDFs efficiently, even with limited photometric data, advancing galaxy property estimation methods.
Findings
RF-based PDFs outperform template fitting in accuracy.
Method is extremely fast, processing a million galaxies in under 6 minutes.
Developed GALPRO, an accessible Python tool for on-the-fly PDF generation.
Abstract
We demonstrate that highly accurate joint redshift-stellar mass probability distribution functions (PDFs) can be obtained using the Random Forest (RF) machine learning (ML) algorithm, even with few photometric bands available. As an example, we use the Dark Energy Survey (DES), combined with the COSMOS2015 catalogue for redshifts and stellar masses. We build two ML models: one containing deep photometry in the bands, and the second reflecting the photometric scatter present in the main DES survey, with carefully constructed representative training data in each case. We validate our joint PDFs for test galaxies by utilizing the copula probability integral transform and the Kendall distribution function, and their univariate counterparts to validate the marginals. Benchmarked against a basic set-up of the template-fitting code BAGPIPES, our ML-based method outperforms…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
