Pre-training vision models for the classification of alerts from wide-field time-domain surveys
Nabeel Rehemtulla, Adam A. Miller, Mike Walmsley, Ved G. Shah, Theophile Jegou du Laz, Michael W. Coughlin, Argyro Sasli, Joshua Bloom, Christoffer Fremling, Matthew J. Graham, Steven L. Groom, David Hale, Ashish A. Mahabal, Daniel A. Perley, Josiah Purdum, Ben Rusholme

TL;DR
This paper demonstrates that pre-trained, standardized CNN architectures, especially those pre-trained on galaxy images, outperform custom models in classifying alerts from wide-field time-domain surveys, offering improved efficiency and accuracy.
Contribution
It shows that adopting pre-trained, standardized vision models from computer vision significantly enhances alert classification in astronomy, surpassing custom CNNs in performance and efficiency.
Findings
Pre-trained models match or outperform custom CNNs.
Galaxy Zoo pre-training yields better results than ImageNet.
Standard architectures are more efficient than custom models.
Abstract
Modern wide-field time-domain surveys facilitate the study of transient, variable and moving phenomena by conducting image differencing and relaying alerts to their communities. Machine learning tools have been used on data from these surveys and their precursors for more than a decade, and convolutional neural networks (CNNs), which make predictions directly from input images, saw particularly broad adoption through the 2010s. Since then, continually rapid advances in computer vision have transformed the standard practices around using such models. It is now commonplace to use standardized architectures pre-trained on large corpora of everyday images (e.g., ImageNet). In contrast, time-domain astronomy studies still typically design custom CNN architectures and train them from scratch. Here, we explore the effects of adopting various pre-training regimens and standardized model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗nabeelr/BTSbot-convnext-pico-galaxyzoomodel· 3 dl3 dl
- 🤗nabeelr/BTSbot-maxvit-tiny-galaxyzoomodel· 2 dl2 dl
- 🤗nabeelr/BTSbot-maxvit-tiny-in1kmodel· 3 dl3 dl
- 🤗nabeelr/BTSbot-maxvit-tiny-randinitmodel· 2 dl2 dl
- 🤗nabeelr/BTSbot-convnext-pico-in1kmodel· 2 dl2 dl
- 🤗nabeelr/BTSbot-convnext-pico-randinitmodel· 2 dl2 dl
- 🤗nabeelr/BTSbot-convnext-pico-galaxyzoo-metadatamodel· 3 dl3 dl
- 🤗nabeelr/BTSbot-convnext-pico-in1k-metadatamodel· 1 dl1 dl
- 🤗nabeelr/BTSbot-convnext-pico-randinit-metadatamodel· 2 dl2 dl
- 🤗nabeelr/BTSbot-maxvit-tiny-galaxyzoo-metadatamodel· 2 dl2 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Radio Astronomy Observations and Technology · Remote-Sensing Image Classification
