# On the realistic validation of photometric redshifts, or why Teddy will   never be Happy

**Authors:** R. Beck, C.-A. Lin, E. E. O. Ishida, F. Gieseke, R. S. de Souza, M. V., Costa-Duarte, M. W. Hattab, A. Krone-Martins (for the COIN Collaboration)

arXiv: 1701.08748 · 2019-08-13

## TL;DR

This paper introduces realistic galaxy catalogues to rigorously test photometric redshift methods, revealing their limitations under real-world data biases and errors, and highlighting the need for improved models in future surveys.

## Contribution

The work provides the first controlled environment with galaxy catalogues that mimic real biases, enabling systematic evaluation of photo-z methods under realistic conditions.

## Key findings

- ML algorithms perform poorly with missing feature coverage
- Global models are more robust in incomplete data scenarios
- Photometric errors significantly impact photo-z accuracy

## Abstract

Two of the main problems encountered in the development and accurate validation of photometric redshift (photo-z) techniques are the lack of spectroscopic coverage in feature space (e.g. colours and magnitudes) and the mismatch between photometric error distributions associated with the spectroscopic and photometric samples. Although these issues are well known, there is currently no standard benchmark allowing a quantitative analysis of their impact on the final photo-z estimation. In this work, we present two galaxy catalogues, Teddy and Happy, built to enable a more demanding and realistic test of photo-z methods. Using photometry from the Sloan Digital Sky Survey and spectroscopy from a collection of sources, we constructed datasets which mimic the biases between the underlying probability distribution of the real spectroscopic and photometric sample. We demonstrate the potential of these catalogues by submitting them to the scrutiny of different photo-z methods, including machine learning (ML) and template fitting approaches. Beyond the expected bad results from most ML algorithms for cases with missing coverage in feature space, we were able to recognize the superiority of global models in the same situation and the general failure across all types of methods when incomplete coverage is convoluted with the presence of photometric errors - a data situation which photo-z methods were not trained to deal with up to now and which must be addressed by future large scale surveys. Our catalogues represent the first controlled environment allowing a straightforward implementation of such tests. The data are publicly available within the COINtoolbox (https://github.com/COINtoolbox/photoz_catalogues).

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1701.08748/full.md

## Figures

14 figures with captions in the complete paper: https://tomesphere.com/paper/1701.08748/full.md

## References

89 references — full list in the complete paper: https://tomesphere.com/paper/1701.08748/full.md

---
Source: https://tomesphere.com/paper/1701.08748