Identification of single spectral lines in large spectroscopic surveys using UMLAUT: an Unsupervised Machine Learning Algorithm based on Unbiased Topology
I. Baronchelli, C. M. Scarlata, L. Rodriguez-Mu\~noz, M. Bonato, L., Morselli, M. Vaccari, R. Carraro, L. Barrufet, A. Henry, V. Mehta, G., Rodighiero, A. Baruffolo, M. Bagley, A. Battisti, J. Colbert, Y. S. Dai, M., De Pascale, H. Dickinson, M. Malkan, C. Mancini, M. Rafelski

TL;DR
This paper introduces UMLAUT, an unsupervised machine learning algorithm that accurately identifies single spectral emission lines in large spectroscopic surveys, aiding redshift determination in upcoming space missions.
Contribution
The paper presents a novel unsupervised topology-based algorithm, UMLAUT, capable of identifying single emission lines by integrating multiple source features, enhancing accuracy in spectroscopic analysis.
Findings
UMLAUT correctly identifies 83.2% of real lines in survey data.
Combining UMLAUT with previous supervised methods increases accuracy to 84.4%.
The algorithm is adaptable for future large-scale IR spectroscopic surveys.
Abstract
The identification of an emission line is unambiguous when multiple spectral features are clearly visible in the same spectrum. However, in many cases, only one line is detected, making it difficult to correctly determine the redshift. We developed a freely available unsupervised machine-learning algorithm based on unbiased topology (UMLAUT) that can be used in a very wide variety of contexts, including the identification of single emission lines. To this purpose, the algorithm combines different sources of information, such as the apparent magnitude, size and color of the emitting source, and the equivalent width and wavelength of the detected line. In each specific case, the algorithm automatically identifies the most relevant ones (i.e., those able to minimize the dispersion associated with the output parameter). The outputs can be easily integrated into different algorithms,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Metabolomics and Mass Spectrometry Studies
