Harvesting the Ly\alpha\ forest with convolutional neural networks
Ting-Yun Cheng, Ryan Cooke, Gwen Rudie

TL;DR
This paper introduces a convolutional neural network (CNN) that efficiently identifies and characterizes low HI column density Lyα absorption systems in quasar spectra, significantly improving analysis speed and accuracy over traditional methods.
Contribution
The study presents a novel CNN-based algorithm trained on simulated spectra to accurately detect and predict properties of Lyα forest absorbers, demonstrating high performance on real observational data.
Findings
78% of CNN-identified systems match manual catalogues
Stable performance for spectra with S/N ≥ 10
Predictions are fast and accurate within specified parameter ranges
Abstract
We develop a machine learning based algorithm using a convolutional neural network (CNN) to identify low HI column density Ly absorption systems () in the Ly forest, and predict their physical properties, such as their HI column density (), redshift (), and Doppler width (). Our CNN models are trained using simulated spectra (S/N ), and we test their performance on high quality spectra of quasars at redshift observed with the High Resolution Echelle Spectrometer on the Keck I telescope. We find that of the systems identified by our algorithm are listed in the manual Voigt profile fitting catalogue. We demonstrate that the performance of our CNN is stable and consistent for all simulated and observed spectra with S/N…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
