Automatic unsupervised classification of all SDSS/DR7 galaxy spectra
J. Sanchez Almeida (1, 2), J. A. L. Aguerri (1, 2), C., Munoz-Tunon (1, 2), A. de Vicente (1, 2) ((1) Instituto de Astrofisica, de Canarias, La Laguna, Tenerife, Spain, (2) Departamento de Astrofisica,, Universidad de La Laguna, Tenerife, Spain)

TL;DR
This paper presents an unsupervised, automated classification of SDSS galaxy spectra into 17 major classes using k-means clustering, revealing correlations with galaxy properties and enabling various astrophysical applications.
Contribution
The study introduces the ASK classification, a novel unsupervised method that classifies galaxy spectra into meaningful classes without prior labels, based solely on spectral features.
Findings
99% of galaxies classified into 17 major classes
Classes correspond to galaxy colors and activity types
Classification useful for physical property estimation and target selection
Abstract
Using the 'k-means' cluster analysis algorithm, we carry out an unsupervised classification of all galaxy spectra in the seventh and final Sloan Digital Sky Survey data release (SDSS/DR7). Except for the shift to restframe wavelengths, and the normalization to the g-band flux, no manipulation is applied to the original spectra. The algorithm guarantees that galaxies with similar spectra belong to the same class. We find that 99 % of the galaxies can be assigned to only 17 major classes, with 11 additional minor classes including the remaining 1%. The classification is not unique since many galaxies appear in between classes, however, our rendering of the algorithm overcomes this weakness with a tool to identify borderline galaxies. Each class is characterized by a template spectrum, which is the average of all the spectra of the galaxies in the class. These low noise template spectra…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
