Automated unsupervised classification of the Sloan Digital Sky Survey stellar spectra using k-means clustering
J. Sanchez Almeida (1,2), C. Allende Prieto (1,2) ((1) Instituto de, Astrofisica de Canarias, La Laguna, Tenerife, Spain, (2) Departamento de, Astrofisica, Universidad de La Laguna, Tenerife, Spain)

TL;DR
This paper demonstrates the use of k-means clustering for automated, unsupervised classification of SDSS stellar spectra, effectively distinguishing stellar types and estimating physical parameters with minimal computational effort.
Contribution
It introduces a novel application of k-means clustering to classify stellar spectra without prior physical models, including analysis of spectra with and without continuum.
Findings
16 classes for spectra with continuum, distinguishing stars by color and type
13 classes for spectra without continuum, better separating stars by metallicity
Classification can estimate physical parameters with small ranges, aiding stellar analysis
Abstract
(Abridged) This paper explores the use of k-means clustering as a tool for automated unsupervised classification of massive stellar spectral catalogs. The classification criteria are defined by the data and the algorithm, with no prior physical framework. We work with a representative set of stellar spectra associated with the SDSS SEGUE and SEGUE-2 programs. We classify the original spectra as well as the spectra with the continuum removed. The second set only contains spectral lines, and it is less dependent on uncertainties of the flux calibration. The classification of the spectra with continuum renders 16 major classes. Roughly speaking, stars are split according to their colors, with enough finesse to distinguish dwarfs from giants of the same effective temperature, but with difficulties to separate stars with different metallicities. Overall, there is no one-to-one correspondence…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
