TL;DR
This paper presents a scalable Gaussian process classification method using PCA embeddings for efficient star-galaxy separation in large-scale survey images, outperforming some existing algorithms and providing uncertainty quantification.
Contribution
The authors develop a linear-scaling hyperparameter training method for Gaussian processes applied to PCA-embedded image data, enabling large-scale optical image classification.
Findings
GP models perform favorably compared to CNNs and morphology discriminators.
The method provides reliable uncertainty estimates for classifications.
Scalable GP classification effectively processes large survey datasets.
Abstract
We introduce a novel method for discerning optical telescope images of stars from those of galaxies using Gaussian processes (GPs). Although applications of GPs often struggle in high-dimensional data modalities such as optical image classification, we show that a low-dimensional embedding of images into a metric space defined by the principal components of the data suffices to produce high-quality predictions from real large-scale survey data. We develop a novel method of GP classification hyperparameter training that scales approximately linearly in the number of image observations, which allows for application of GP models to large-size Hyper Suprime-Cam (HSC) Subaru Strategic Program data. In our experiments we evaluate the performance of a principal component analysis (PCA) embedded GP predictive model against other machine learning algorithms including a convolutional neural…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
