From Simulations to Surveys: Domain Adaptation for Galaxy Observations
Kaley Brauer, Aditya Prasad Dash, Meet J. Vyas, Ahmed Salim, Stiven Briand Massala

TL;DR
This paper introduces a domain adaptation pipeline that improves the transfer of galaxy morphology classification models trained on simulated data to real survey data by using advanced loss functions and top-k matching techniques.
Contribution
It presents a novel domain adaptation approach combining feature-level domain loss with top-k soft matching to enhance model transfer from simulations to real galaxy observations.
Findings
Target accuracy improved from ~46% to ~87% with adaptation.
Domain AUC near 0.5 indicates effective latent-space mixing.
Combines multiple loss functions for better domain alignment.
Abstract
Large photometric surveys will image billions of galaxies, but we currently lack quick, reliable automated ways to infer their physical properties like morphology, stellar mass, and star formation rates. Simulations provide galaxy images with ground-truth physical labels, but domain shifts in PSF, noise, backgrounds, selection, and label priors degrade transfer to real surveys. We present a preliminary domain adaptation pipeline that trains on simulated TNG50 galaxies and evaluates on real SDSS galaxies with morphology labels (elliptical/spiral/irregular). We train three backbones (CNN, -steerable CNN, ResNet-18) with focal loss and effective-number class weighting, and a feature-level domain loss built from GeomLoss (entropic Sinkhorn OT, energy distance, Gaussian MMD, and related metrics). We show that a combination of these losses with an OT-based "top_ soft matching"…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Galaxies: Formation, Evolution, Phenomena · Gaussian Processes and Bayesian Inference
