Homogeneous Stellar Parameters from Heterogeneous Spectra with Deep Learning
Jeff Shen, Joshua S. Speagle, Shirley Ho

TL;DR
This paper introduces a Transformer-based deep learning framework that provides consistent stellar parameters and chemical abundances across multiple spectroscopic surveys, enabling large-scale Galactic archaeology.
Contribution
The authors develop a unified, end-to-end deep learning model that simultaneously analyzes spectra from diverse surveys without post-hoc calibration, ensuring homogeneous stellar labels.
Findings
Achieves high precision in stellar parameters on APOGEE spectra
Demonstrates consistency of labels across different surveys
Provides a publicly available homogeneous stellar catalog
Abstract
Large-scale spectroscopic surveys have collectively observed millions of stars across the Milky Way, but each derives stellar labels using independent pipelines with distinct modelling assumptions, introducing systematic offsets that obscure signals in chemical space and hinder large-scale Galactic archaeology. We present a unified deep-learning framework that delivers atmospheric parameters, chemical abundances for 20 elements, distances, and ages -- all on a single, self-consistent scale -- for an arbitrary number of spectroscopic surveys simultaneously. Our approach uses a Transformer model that ingests spectra of arbitrary wavelength range and resolution, trained end-to-end as a single model across all surveys, eliminating the need for post-hoc recalibration. We apply this framework to spectra from APOGEE DR17, GALAH DR3, DESI DR1, and RVS DR3, spanning resolutions…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
