Photometric identification of blue horizontal branch stars
Kester W. Smith, Coryn A.L. Bailer-Jones, Rainer J. Klement,, Xiang-Xiang Xue

TL;DR
This paper evaluates machine learning techniques for identifying blue horizontal branch stars from photometric data, comparing their performance and applying the best method to classify a large SDSS dataset.
Contribution
It introduces a support vector machine approach for BHB star classification and provides a comprehensive analysis of method performance and application to large survey data.
Findings
SVM yields lowest contamination at given completeness
No-prior approach maximizes completeness, prior reduces contamination
Identified over 27,000 probable BHB stars in SDSS DR7
Abstract
We investigate the performance of some common machine learning techniques in identifying BHB stars from photometric data. To train the machine learning algorithms, we use previously published spectroscopic identifications of BHB stars from SDSS data. We investigate the performance of three different techniques, namely k nearest neighbour classification, kernel density estimation and a support vector machine (SVM). We discuss the performance of the methods in terms of both completeness and contamination. We discuss the prospect of trading off these values, achieving lower contamination at the expense of lower completeness, by adjusting probability thresholds for the classification. We also discuss the role of prior probabilities in the classification performance, and we assess via simulations the reliability of the dataset used for training. Overall it seems that no-prior gives the best…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
