Machine learning in APOGEE: Unsupervised spectral classification with $K$-means
Rafael Garcia-Dias, Carlos Allende Prieto, Jorge S\'anchez Almeida and, Ignacio Ordov\'as-Pascual

TL;DR
This paper applies $K$-means clustering to a large set of APOGEE stellar spectra to classify stars and explore the effectiveness of unsupervised learning in astronomical spectral analysis.
Contribution
It demonstrates the application of $K$-means to high-resolution stellar spectra, revealing its strengths and limitations in classifying stellar populations.
Findings
Successfully separates bulge and halo populations
Distinguishes different stellar evolutionary stages
Highlights limitations of $K$-means in flux space
Abstract
The data volume generated by astronomical surveys is growing rapidly. Traditional analysis techniques in spectroscopy either demand intensive human interaction or are computationally expensive. In this scenario, machine learning, and unsupervised clustering algorithms in particular offer interesting alternatives. The Apache Point Observatory Galactic Evolution Experiment (APOGEE) offers a vast data set of near-infrared stellar spectra which is perfect for testing such alternatives. Apply an unsupervised classification scheme based on -means to the massive APOGEE data set. Explore whether the data are amenable to classification into discrete classes. We apply the -means algorithm to 153,847 high resolution spectra (). We discuss the main virtues and weaknesses of the algorithm, as well as our choice of parameters. We show that a classification based on normalised…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStellar, planetary, and galactic studies · Astronomy and Astrophysical Research · Spectroscopy and Chemometric Analyses
