Inferring stellar metallicity and elemental abundances from kinematic and spectroscopic data using machine learning -- Implications for exoplanet host stars

V. Adibekyan; B.M.T.B. Soares; S.G.Sousa; N.C. Santos; E. Delgado-Mena; I. Minchev; R. Chertovskih; Zh. Martirosyan; G. Israelian; and A.A. Hakobyan

arXiv:2605.18131·astro-ph.EP·May 19, 2026

Inferring stellar metallicity and elemental abundances from kinematic and spectroscopic data using machine learning -- Implications for exoplanet host stars

V. Adibekyan, B.M.T.B. Soares, S.G.Sousa, N.C. Santos, E. Delgado-Mena, I. Minchev, R. Chertovskih, Zh. Martirosyan, G. Israelian, and A.A. Hakobyan

PDF

TL;DR

This study employs machine learning to infer stellar elemental abundances and metallicity from spectroscopic and kinematic data, improving predictions for FGK stars and exploring implications for exoplanet host stars.

Contribution

It introduces optimized machine learning models trained on APOGEE data to predict stellar abundances, incorporating kinematic information and deriving empirical relations.

Findings

01

Kinematic data alone limits metallicity prediction accuracy (RMSE ~0.20 dex).

02

Combining [Fe/H] with kinematics improves abundance predictions significantly.

03

Mg abundance is the most influential predictor for C and O from other elemental data.

Abstract

(abridged) Elemental abundances of FGK stars can be derived routinely from high-resolution optical spectra, but this remains considerably more difficult for cooler stars. Machine-learning methods offer a practical route to infer otherwise inaccessible abundances from more widely available stellar data. We use a large APOGEE DR17 sample of red giant stars as the main training set and an independent HARPS sample of nearby FGK dwarfs for external validation. We benchmark several machine-learning regressors, optimise the strongest models, and analyse feature importance using gain-based metrics, permutation importance, single-feature models, and SHAP values. We also explored the prediction of C and O from Mg, Si, and [Fe/H], and derived simple empirical relations between selected abundance ratios (Fe/Si, Mg/Si, C/O, and Fe/O) and metallicity. Kinematic information alone recovers only a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.