Automatic identification of outliers in Hubble Space Telescope galaxy images
Lior Shamir

TL;DR
This paper presents an unsupervised machine learning algorithm for automatically detecting outlier galaxy images in Hubble Space Telescope data, significantly reducing dataset size for manual review and identifying rare objects.
Contribution
The paper introduces a training-free, unsupervised outlier detection method applied to astronomical images, enabling efficient identification of rare galaxies without prior labeled data.
Findings
Reduced dataset by two orders of magnitude
Identified 147 potential outlier galaxies
Algorithm effectively narrows down candidates for manual review
Abstract
Rare extragalactic objects can carry substantial information about the past, present, and future universe. Given the size of astronomical databases in the information era it can be assumed that very many outlier galaxies are included in existing and future astronomical databases. However, manual search for these objects is impractical due to the required labor, and therefore the ability to detect such objects largely depends on computer algorithms. This paper describes an unsupervised machine learning algorithm for automatic detection of outlier galaxy images, and its application to several Hubble Space Telescope fields. The algorithm does not require training, and therefore is not dependent on the preparation of clean training sets. The application of the algorithm to a large collection of galaxies detected a variety of outlier galaxy images. The algorithm is not perfect in the sense…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
