Estimating Galaxy Redshift in Radio-Selected Datasets using Machine Learning
Kieran J. Luken, Ray P. Norris, Laurence A. F. Park, X. Rosalind Wang,, Miroslav D. Filipovic

TL;DR
This study evaluates machine learning methods, particularly kNN and Random Forest, for estimating galaxy redshifts from radio-selected datasets, finding effective classification of high-redshift galaxies despite challenges in regression accuracy.
Contribution
It compares various distance metrics and modes of machine learning algorithms for redshift estimation, highlighting the effectiveness of classification in identifying high-redshift galaxies.
Findings
kNN with Mahalanobis distance performs best for low redshift regression.
Classification mode effectively identifies high-redshift galaxies.
Field variation does not significantly affect redshift predictions.
Abstract
All-sky radio surveys are set to revolutionise the field with new discoveries. However, the vast majority of the tens of millions of radio galaxies won't have the spectroscopic redshift measurements required for a large number of science cases. Here, we evaluate techniques for estimating redshifts of galaxies from a radio-selected survey. Using a radio-selected sample with broadband photometry at infrared and optical wavelengths, we test the k-Nearest Neighbours (kNN) and Random Forest machine learning algorithms, testing them both in their regression and classification modes. Further, we test different distance metrics used by the kNN algorithm, including the standard Euclidean distance, the Mahalanobis distance and a learned distance metric for both the regression mode (the Metric Learning for Kernel Regression metric) and the classification mode (the Large Margin Nearest Neighbour…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Statistical Methods and Models · Galaxies: Formation, Evolution, Phenomena · Advanced Measurement and Metrology Techniques
