Star-Galaxy Classification in Multi-Band Optical Imaging

Ross Fadely; David W. Hogg; and Beth Willman

arXiv:1206.4306·astro-ph.IM·June 5, 2015

Star-Galaxy Classification in Multi-Band Optical Imaging

Ross Fadely, David W. Hogg, and Beth Willman

PDF

TL;DR

This paper compares photometric star-galaxy classification methods for deep optical surveys, finding Hierarchical Bayesian template fitting and well-trained SVMs outperform traditional approaches, though perfect separation remains challenging.

Contribution

It introduces a Hierarchical Bayesian template fitting method and compares it with SVMs, highlighting the importance of training data quality for star-galaxy classification.

Findings

01

HB outperforms ML with ~80% completeness and 60-90% purity.

02

Well-trained SVMs outperform template-fitting methods.

03

No algorithm achieves perfect classification, especially for metal-poor stars.

Abstract

Ground-based optical surveys such as PanSTARRS, DES, and LSST, will produce large catalogs to limiting magnitudes of r > 24. Star-galaxy separation poses a major challenge to such surveys because galaxies---even very compact galaxies---outnumber halo stars at these depths. We investigate photometric classification techniques on stars and galaxies with intrinsic FWHM < 0.2 arcsec. We consider unsupervised spectral energy distribution template fitting and supervised, data-driven Support Vector Machines (SVM). For template fitting, we use a Maximum Likelihood (ML) method and a new Hierarchical Bayesian (HB) method, which learns the prior distribution of template probabilities from the data. SVM requires training data to classify unknown sources; ML and HB don't. We consider i.) a best-case scenario (SVM_best) where the training data is (unrealistically) a random sampling of the data in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.