Pre-training vision models for the classification of alerts from wide-field time-domain surveys

Nabeel Rehemtulla; Adam A. Miller; Mike Walmsley; Ved G. Shah; Theophile Jegou du Laz; Michael W. Coughlin; Argyro Sasli; Joshua Bloom; Christoffer Fremling; Matthew J. Graham; Steven L. Groom; David Hale; Ashish A. Mahabal; Daniel A. Perley; Josiah Purdum; Ben Rusholme; Jesper Sollerman; Mansi M. Kasliwal

arXiv:2512.11957·astro-ph.IM·March 12, 2026

Pre-training vision models for the classification of alerts from wide-field time-domain surveys

Nabeel Rehemtulla, Adam A. Miller, Mike Walmsley, Ved G. Shah, Theophile Jegou du Laz, Michael W. Coughlin, Argyro Sasli, Joshua Bloom, Christoffer Fremling, Matthew J. Graham, Steven L. Groom, David Hale, Ashish A. Mahabal, Daniel A. Perley, Josiah Purdum, Ben Rusholme

PDF

Open Access 10 Models

TL;DR

This paper demonstrates that pre-trained, standardized CNN architectures, especially those pre-trained on galaxy images, outperform custom models in classifying alerts from wide-field time-domain surveys, offering improved efficiency and accuracy.

Contribution

It shows that adopting pre-trained, standardized vision models from computer vision significantly enhances alert classification in astronomy, surpassing custom CNNs in performance and efficiency.

Findings

01

Pre-trained models match or outperform custom CNNs.

02

Galaxy Zoo pre-training yields better results than ImageNet.

03

Standard architectures are more efficient than custom models.

Abstract

Modern wide-field time-domain surveys facilitate the study of transient, variable and moving phenomena by conducting image differencing and relaying alerts to their communities. Machine learning tools have been used on data from these surveys and their precursors for more than a decade, and convolutional neural networks (CNNs), which make predictions directly from input images, saw particularly broad adoption through the 2010s. Since then, continually rapid advances in computer vision have transformed the standard practices around using such models. It is now commonplace to use standardized architectures pre-trained on large corpora of everyday images (e.g., ImageNet). In contrast, time-domain astronomy studies still typically design custom CNN architectures and train them from scratch. Here, we explore the effects of adopting various pre-training regimens and standardized model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Radio Astronomy Observations and Technology · Remote-Sensing Image Classification