Star-Galaxy Classification in Multi-Band Optical Imaging
Ross Fadely, David W. Hogg, and Beth Willman

TL;DR
This paper compares photometric star-galaxy classification methods for deep optical surveys, finding Hierarchical Bayesian template fitting and well-trained SVMs outperform traditional approaches, though perfect separation remains challenging.
Contribution
It introduces a Hierarchical Bayesian template fitting method and compares it with SVMs, highlighting the importance of training data quality for star-galaxy classification.
Findings
HB outperforms ML with ~80% completeness and 60-90% purity.
Well-trained SVMs outperform template-fitting methods.
No algorithm achieves perfect classification, especially for metal-poor stars.
Abstract
Ground-based optical surveys such as PanSTARRS, DES, and LSST, will produce large catalogs to limiting magnitudes of r > 24. Star-galaxy separation poses a major challenge to such surveys because galaxies---even very compact galaxies---outnumber halo stars at these depths. We investigate photometric classification techniques on stars and galaxies with intrinsic FWHM < 0.2 arcsec. We consider unsupervised spectral energy distribution template fitting and supervised, data-driven Support Vector Machines (SVM). For template fitting, we use a Maximum Likelihood (ML) method and a new Hierarchical Bayesian (HB) method, which learns the prior distribution of template probabilities from the data. SVM requires training data to classify unknown sources; ML and HB don't. We consider i.) a best-case scenario (SVM_best) where the training data is (unrealistically) a random sampling of the data in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
