Transferring spectroscopic stellar labels to 217 million Gaia DR3 XP stars with SHBoost
A. Khalatyan, F. Anders, C. Chiappini, A. B. A. Queiroz, S. Nepal, M., dal Ponte, C. Jordi, G. Guiglion, M. Valentini, G. Torralba Elipe, M., Steinmetz, M. Pantaleoni-Gonz\'alez, S. Malhotra, \'O. Jim\'enez-Arranz, H., Enke, L. Casamiquela, J. Ard\`evol

TL;DR
This study demonstrates that machine learning regression using gradient-boosted trees can accurately derive stellar parameters and extinctions from Gaia DR3 XP spectra for over 217 million stars, without ground-based spectra.
Contribution
It introduces a novel application of gradient-boosted random-forest regression to extract stellar parameters from Gaia XP spectra at an unprecedented scale, with reliable uncertainties and interpretability.
Findings
Achieved median uncertainties of 0.20 mag in extinction and 0.01 dex in temperature.
Predicted stellar parameters with competitive accuracy compared to classical methods.
Extended the stellar parameter estimation to over 217 million Gaia stars.
Abstract
In this paper, we explore the feasibility of using machine learning regression as a method of extracting basic stellar parameters and line-of-sight extinctions from spectro-photometric data. We built a stable gradient-boosted random-forest regressor (xgboost), trained on spectroscopic data, capable of producing output parameters with reliable uncertainties from Gaia DR3 data (most notably the low-resolution XP spectra), without ground-based spectroscopic observations. Using Shapley additive explanations, we interpret how the predictions for each star are influenced by each data feature. For the training and testing of the network, we used high-quality parameters obtained from the StarHorse code for a sample of around eight million stars observed by major spectroscopic stellar surveys, complemented by curated samples of hot stars, very metal-poor stars, white dwarfs, and hot sub-dwarfs.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAstronomy and Astrophysical Research · Stellar, planetary, and galactic studies · Gamma-ray bursts and supernovae
