Data-driven stellar intrinsic colors and dust reddenings for spectro-photometric data: From the blue-edge method to a machine-learning approach
He Zhao, Shu Wang, Biwei Jiang, Jun Li, Dongwei Fan, Yi Ren, Xiaoxiao, Ma

TL;DR
This paper develops a machine learning model using XGBoost to accurately predict stellar intrinsic colors and dust reddenings from spectro-photometric data, enabling large-scale dust and stellar property studies.
Contribution
It introduces a novel machine learning approach for predicting stellar intrinsic colors and dust reddening, validated on extensive spectro-photometric data, improving accuracy over previous methods.
Findings
Model achieves low prediction errors for intrinsic colors.
Application to millions of stars validates model accuracy.
Reveals large-scale variation in extinction law across the Galaxy.
Abstract
Intrinsic colors (ICs) of stars are essential for the studies on both stellar physics and dust reddening. In this work, we developed an XGBoost model to predict the ICs with the atmospheric parameters , , and . The model was trained and tested for three colors at Gaia and 2MASS bands with 1,040,446 low-reddening sources. The atmospheric parameters were determined by the Gaia DR3 GSP-phot module and were validated by comparing with APOGEE and LAMOST. We further confirmed that the biases in GSP-phot parameters, especially for , do not present a significant impact on the IC prediction. The generalization error of the model estimated by the test set is 0.014 mag for , 0.050 mag for , and 0.040 mag for . The model was applied to a sample containing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
