J-PAS: Semi-Supervised Sim-to-Obs Transfer for Robust Star--Galaxy--Quasar Classification
Daniel L\'opez-Cano, L. Raul Abramo, L. Nakazono, I. P\'erez-R\`afols, G. Mart\'inez-Solaeche, J. Chaves-Montero, Matthew M. Pieri, Jailson Alcaniz, Narciso Benitez, Silvia Bonoli, Saulo Carneiro, Javier Cenarro, David Crist\'obal-Hornillos, Simone Daflon, Renato Dupke

TL;DR
This paper presents a semi-supervised domain adaptation method for classifying stars, galaxies, and quasars in astrophysical data, improving accuracy by leveraging simulated data and limited real observations.
Contribution
It introduces a semi-supervised transfer learning pipeline that enhances astrophysical classification by combining simulated and real data with minimal labeled target data.
Findings
Achieves higher macro-F1 score (0.82) than baselines (0.79) and mocks-only model (0.73).
Significantly improves high-redshift quasar classification (F1=0.66).
Provides better candidate lists for spectroscopic surveys.
Abstract
Modern studies in astrophysics and cosmology increasingly rely on simulations and cross-survey analyses, yet differences in data generation, instrumentation, calibration, and unmodeled physics introduce distribution mismatches between datasets (``domain shift''). In machine-learning pipelines, this occurs when the joint distribution of inputs and labels differs between the training (source) and application (target) domains, causing source-trained models to underperform on the target. Transfer learning and domain adaptation provide principled ways to mitigate this effect. We study a concrete simulation-to-observation case: semi-supervised domain adaptation (SSDA) to transfer a four-class spectral classifier -- high-redshift quasars, low-redshift quasars, galaxies, and stars -- from J-PAS mock catalogs based on DESI spectra to real J-PAS observations. Our pipeline pretrains on abundant…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGalaxies: Formation, Evolution, Phenomena · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification
