StratLearn-z: Improved photo-$z$ estimation from spectroscopic data subject to selection effects
Chiara Moretti, Maximilian Autenrieth, Riccardo Serra, Roberto Trotta, David A. van Dyk, Andrei Mesinger

TL;DR
This paper introduces StratLearn-z, a stratification-based machine learning method that improves photometric redshift estimation under selection bias, demonstrating robustness and reduced errors compared to existing algorithms.
Contribution
The paper presents a novel stratification approach for photo-z estimation that mitigates covariate shift effects, enhancing accuracy and reducing catastrophic errors in biased datasets.
Findings
StratLearn-z is less affected by selection bias than GPz.
Significant reduction in RMSE and bias with StratLearn-z under strong covariate shift.
Improved probability integral transform (PIT) distribution indicating better uncertainty calibration.
Abstract
A precise measurement of photometric redshifts (photo-z) is key for the success of modern photometric galaxy surveys. Machine learning (ML) methods show great promise in this context, but suffer from covariate shift (CS) in training sets due to selection bias where interesting sources are underrepresented, and the corresponding ML models show poor generalisation properties. We present an application of the StratLearn method to the estimation of photo-z, validating against simulations where we enforce the presence of CS to different degrees. StratLearn is a statistically principled approach that relies on splitting the source and target datasets into strata based on estimated propensity scores (i.e. the probability for an object to be in the source set given its observed covariates). After stratification, two conditional density estimators are fit separately to each stratum, then…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpectroscopy Techniques in Biomedical and Chemical Research · Spectroscopy and Chemometric Analyses · Fault Detection and Control Systems
