Beyond the Hubble Sequence -- Exploring Galaxy Morphology with Unsupervised Machine Learning
Ting-Yun Cheng, Marc Huertas-Company, Christopher J. Conselice,, Alfonso Arag\'on-Salamanca, Brant E. Robertson, and Nesar Ramachandra

TL;DR
This paper introduces an unsupervised machine learning approach combining feature extraction and hierarchical clustering to analyze galaxy morphology, revealing well-separated clusters that correlate with physical properties and challenge traditional visual classifications.
Contribution
The study presents a novel methodology integrating feature learning with clustering, considering multiple thresholds and galaxy orientation, leading to more physically meaningful galaxy classifications.
Findings
27 distinct galaxy clusters identified with shape and structure features
Clusters correlate strongly with physical properties like color-magnitude and mass-size relations
Achieved 87% accuracy in binary classification of galaxy types, matching real distributions
Abstract
We explore unsupervised machine learning for galaxy morphology analyses using a combination of feature extraction with a vector-quantised variational autoencoder (VQ-VAE) and hierarchical clustering (HC). We propose a new methodology that includes: (1) consideration of the clustering performance simultaneously when learning features from images; (2) allowing for various distance thresholds within the HC algorithm; (3) using the galaxy orientation to determine the number of clusters. This setup provides 27 clusters created with this unsupervised learning which we show are well separated based on galaxy shape and structure (e.g., S\'ersic index, concentration, asymmetry, Gini coefficient). These resulting clusters also correlate well with physical properties such as the colour-magnitude diagram, and span the range of scaling-relations such as mass vs. size amongst the different…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
