Intervening or associated? Machine learning classification of redshifted H I 21-cm absorption
S. J. Curran

TL;DR
This study improves machine learning classification of HI 21-cm absorption origins, distinguishing intervening from host galaxy sources, which is crucial for upcoming large-scale radio surveys like the SKA.
Contribution
It tests and compares four machine learning algorithms on an expanded dataset, achieving up to 80% accuracy in classifying the origin of HI 21-cm absorption.
Findings
Logistic regression yields the best classification performance.
Adding new absorbers enhances prediction accuracy.
Classified PKS 1657-298 as associated, consistent with previous redshift estimates.
Abstract
In a previous paper we presented the results of applying machine learning to classify whether an HI 21-cm absorption spectrum arises in a source intervening the sight-line to a more distant radio source or within the host of the radio source itself. This is usually determined from an optical spectrum giving the source redshift. However, not only will this be impractical for the large number of sources expected to be detected with the Square Kilometre Array, but bright optical sources are the most ultra-violet luminous at high redshift and so bias against the detection of cool, neutral gas. Adding another 44, mostly newly detected absorbers, to the previous sample of 92, we test four different machine learning algorithms, again using the line properties (width, depth and number of Gaussian fits) as features. Of these algorithms, three gave a some improvement over the previous sample,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
