Transfer Learning for Transient Classification: From Simulations to Real Data and ZTF to LSST
Rithwik Gupta, Daniel Muthukrishna, Nabeel Rehemtulla, Ved Shah

TL;DR
This paper demonstrates that transfer learning can effectively adapt transient classification models from simulations and other surveys to new data, significantly reducing the need for extensive labeled datasets and enabling rapid deployment for upcoming surveys like LSST.
Contribution
The study shows how transfer learning can be used to adapt existing models to new astronomical surveys, reducing training data requirements and improving early classification performance.
Findings
Transfer learning reduces labeled data needs by 95% for ZTF transients.
Transfer learning achieves 94% of baseline performance for LSST simulations with only 30% of training data.
Reliable early classification for LSST is feasible soon after survey start.
Abstract
Machine learning has become essential for automated classification of astronomical transients, but current approaches face significant limitations: classifiers trained on simulations struggle with real data, models developed for one survey cannot be easily applied to another, and new surveys require prohibitively large amounts of labelled training data. These challenges are particularly pressing as we approach the era of the Vera C. Rubin Observatory's Legacy Survey of Space and Time (LSST), where existing classification models will need to be retrained using LSST observations. We demonstrate that transfer learning can overcome these challenges by repurposing existing models trained on either simulations or data from other surveys. Starting with a model trained on simulated Zwicky Transient Facility (ZTF) light curves, we show that transfer learning reduces the amount of labelled real…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
