ATCAT: Astronomical Timeseries CAusal Transformer
Zora Tung

TL;DR
This paper introduces ATCAT, a transformer-based model that achieves state-of-the-art performance in classifying astronomical light curves, demonstrating label efficiency, calibration correction, and high-speed inference suitable for large-scale LSST data analysis.
Contribution
The paper presents a novel transformer model for astronomical time-series classification that outperforms previous methods, with improved efficiency, label efficiency, and calibration correction.
Findings
Achieves 71.8% F1 on LC-only classification and 89.8% on LC+metadata.
Remains effective with only 10% labeled data, achieving 67.4% F1.
Runs inference at ~33000 light curves per second on a consumer GPU.
Abstract
The Legacy Survey of Space and Time (LSST) at the Vera C. Rubin Observatory will capture light curves (LCs) for 10 billion sources and produce millions of transient candidates per night, necessitating scalable, accurate, and efficient classification. To prepare the community for this scale of data, the Extended LSST Astronomical Time-Series Classification Challenge (ELAsTiCC) sought to simulate a diversity of LSST-like time-domain events. Using a small transformer-based model and refined light curve encoding logic, we present a new state of the art classification performance on ELAsTiCC, with 71.8% F1 on LC-only classifications, and 89.8% F1 on LC+metadata classifications. Previous state of the art was 65.5% F1 for LC-only, and for LC+metadata, 84% F1 with a different setup and 83.5% F1 with a directly comparable setup. Our model outperforms previous state-of-the-art models for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDiverse Scientific and Economic Studies · Medical Coding and Health Information · Research Data Management Practices
