Robust Machine Learning Applied to Astronomical Datasets III: Probabilistic Photometric Redshifts for Galaxies and Quasars in the SDSS and GALEX
Nicholas M. Ball (1), Robert J. Brunner (1, 2), Adam D. Myers (1),, Natalie E. Strand (3), Stacey L. Alberts (1), David Tcheng (2) ((1), Department of Astronomy, University of Illinois at Urbana-Champaign, (2), National Center for Supercomputing Applications

TL;DR
This paper presents a novel machine learning approach using nearest neighbor algorithms to generate probabilistic photometric redshifts for galaxies and quasars, significantly reducing catastrophic errors especially for quasars by incorporating UV data from GALEX.
Contribution
Introduces a simple yet effective NN-based method to produce full redshift PDFs, improving accuracy and reducing catastrophic failures in photometric redshift estimation.
Findings
Achieves sigma = 0.0207 for main galaxies, sigma = 0.0243 for luminous red galaxies, and sigma = 0.343 for quasars.
Reduces quasar redshift errors from 0.343 to 0.117 for single-peak PDFs.
Incorporating UV data from GALEX significantly improves quasar redshift accuracy.
Abstract
We apply machine learning in the form of a nearest neighbor instance-based algorithm (NN) to generate full photometric redshift probability density functions (PDFs) for objects in the Fifth Data Release of the Sloan Digital Sky Survey (SDSS DR5). We use a conceptually simple but novel application of NN to generate the PDFs - perturbing the object colors by their measurement error - and using the resulting instances of nearest neighbor distributions to generate numerous individual redshifts. When the redshifts are compared to existing SDSS spectroscopic data, we find that the mean value of each PDF has a dispersion between the photometric and spectroscopic redshift consistent with other machine learning techniques, being sigma = 0.0207 +/- 0.0001 for main sample galaxies to r < 17.77 mag, sigma = 0.0243 +/- 0.0002 for luminous red galaxies to r < ~19.2 mag, and sigma = 0.343 +/- 0.005…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
