Identification, mass and age of primary red clump stars from spectral features derived with the LAMOST DR7
Xu-Jiang He, A-Li Luo, and Yu-Qin Chen

TL;DR
This study develops a machine learning approach using XGBoost to accurately identify primary red clump stars, estimate their ages and masses, and analyze spectral features, achieving high purity and reliable distance measurements from LAMOST DR7 data.
Contribution
The paper introduces a novel application of XGBoost and SHAP for distinguishing RC and RGB stars, estimating stellar parameters, and validating feature importance with spectral data.
Findings
Achieved over 90% purity in identifying primary RC stars.
Estimated stellar ages and masses with uncertainties of 31% and 13%.
Demonstrated effectiveness at lower spectral resolution (R~250).
Abstract
Although red clump (RC) stars are easy to identify due to their stability of luminosity and color, about 20-50% are actually red giant branch (RGB) stars in the same location on the HR diagram. In this paper, a sample of 210,504 spectra for 184 318 primary RC (PRC) stars from the LAMOST DR7 is identified, which has a purity of higher than 90 percent. The RC and the RGB stars are successfully distinguished through LAMOST spectra(R~1800 and SNR>10) by adopting the XGBoost ensemble learning algorithm, and the secondary RC stars are also removed. The SHapley Additive exPlanations (SHAP) value is used to explain the top features that the XGBoost model selected. The features are around Fe5270, MgH & MgIb, Fe4957, Fe4207, Cr5208, and CN, which can successfully distinguish RGB and RC stars. The XGBoost is also used to estimate the ages and masses of PRC stars by training their spectra with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
