Computer-generated visual morphology catalog of ~3,000,000 SDSS galaxies
Evan Kuminski, Lior Shamir

TL;DR
This paper presents a large, automated catalog of galaxy morphologies for about 3 million SDSS galaxies, achieving high accuracy by leveraging computer analysis validated against Galaxy Zoo data.
Contribution
The authors developed and validated a computer-based method to classify galaxy morphology at an unprecedented scale, producing a publicly accessible catalog with high accuracy.
Findings
Catalog contains ~900,000 spiral galaxies and ~600,000 ellipticals.
Classification accuracy has a ~98% agreement rate with Galaxy Zoo.
The method demonstrates computer analysis can effectively structure large galaxy datasets.
Abstract
We applied computer analysis to classify the broad morphological type of ~3,000,000 SDSS galaxies. The catalog provides for each galaxy the DR8 object ID, right ascension, declination, and the certainty of the automatic classification to spiral or elliptical. The certainty of the classification allows controlling the accuracy of a subset of galaxies by sacrificing some of the least certain classifications. The accuracy of the catalog was tested using galaxies that were classified by the manually annotated Galaxy Zoo catalog. The results show that the catalog contains ~900,000 spiral galaxies and ~600,000 elliptical galaxies with classification certainty that has a statistical agreement rate of ~98% with Galaxy Zoo debiased 'superclean' dataset. That also demonstrates the ability of computers to turn large datasets of galaxy images into structured catalogs of galaxy morphology. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
