Astronomaly Protege: Discovery Through Human-Machine Collaboration
Michelle Lochner, Lawrence Rudnick

TL;DR
Astronomaly Protege is a human-machine collaborative framework that efficiently identifies scientifically interesting and diverse anomalies in large astronomical datasets using active learning and self-supervised deep learning.
Contribution
It introduces Protege, an extension of Astronomaly, that leverages self-supervised learning and active anomaly detection to recommend diverse, scientifically relevant sources with minimal human labeling.
Findings
Protege effectively recommends diverse, interesting sources with minimal training.
It can identify sources that even the authors missed.
The framework enhances discovery potential in large astronomical datasets.
Abstract
Modern telescopes generate catalogs of millions of objects with the potential for new scientific discoveries, but this is beyond what can be examined visually. Here we introduce Astronomaly: Protege, an extension of the general purpose machine learning-based active anomaly detection framework Astronomaly. Protege is designed to provide well-selected recommendations for visual inspection, based on a small amount of optimized human labeling. The resulting sample contains rare or unusual sources which are simultaneously as diverse as the human trainer chooses, and of scientific interest to them. We train Protege on images from the MeerKAT Galaxy Cluster Legacy Survey, leveraging the self-supervised deep learning algorithm Bootstrap Your Own Latent to find a low-dimensional representation of the radio galaxy cutouts. By operating in this feature space, Protege is able to recommend…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHistory and Developments in Astronomy · Research Data Management Practices
