# Development of an interpretable machine learning model for early prediction of aortic stiffness risk in health examination populations

**Authors:** Zhensen Zhou, Xiao Sun, Hongmei Qian, Zuchang Ma, Fuying Kong, Ting Li, Wanqiu Zhang, Yuxia Li

PMC · DOI: 10.3389/fcvm.2025.1730409 · Frontiers in Cardiovascular Medicine · 2026-01-07

## TL;DR

This study created a machine learning model to predict aortic stiffness risk using routine health data, offering a practical alternative to complex measurements.

## Contribution

The novel contribution is an interpretable XGBoost model with a web-based calculator for aortic stiffness risk prediction using accessible health indicators.

## Key findings

- The XGBoost model achieved an AUC of 0.903 on the testing set and 0.979 in cross-validation.
- External validation showed an AUC of 1.000, indicating strong generalizability.
- SHAP analysis provided interpretable insights into feature contributions for model predictions.

## Abstract

Carotid-femoral pulse wave velocity (cfPWV) is the gold standard for assessing aortic stiffness, but its complexity, time consumption, and privacy concerns limit its application in routine health examinations. This study aimed to develop an interpretable machine learning model based on readily accessible indicators, providing an alternative screening tool for institutions unable to perform cfPWV measurements.

A total of 261 participants were enrolled as the development cohort and randomly divided into training and testing sets (7:3 ratio), with 101 additional participants as an independent external validation set. Aortic stiffness was defined as cfPWV ≥10 m/s. Features were selected by combining univariate logistic regression, recursive feature elimination with random forest (RFE-RF), and least absolute shrinkage and selection operator (LASSO) methods. Six machine learning models were constructed: logistic regression (LR), random forest (RF), extreme gradient boosting (XGBoost), support vector machine (SVM), Naive Bayes (NB), and K-nearest neighbours (KNN). Internal validation was performed using 5-fold cross-validation, and performance was evaluated using area under the receiver operating characteristic curve (AUC), calibration curves, and decision curve analysis (DCA). The optimal model was validated on the external set, and interpretability was analyzed using SHapley Additive exPlanations (SHAP). Subsequently, the model was deployed as an interactive, web-based application utilizing the Streamlit framework in Python.

Seven key variables were selected: Age, body mass index (BMI), mean arterial pressure (MAP), fasting blood glucose (FBG), high-density lipoprotein cholesterol (HDL-C), glomerular filtration rate (GFR), and aspartate aminotransferase (AST). The XGBoost model achieved the best performance with an AUC of 0.903 (95% CI: 0.830–0.975) on the testing set and a mean AUC of 0.979 (95% CI: 0.960–0.997) in 5-fold cross-validation on the training set. External validation demonstrated robust generalizability (AUC = 1.000). DCA indicated a favourable clinical net benefit, and SHAP analysis quantified feature contributions.

This study developed a high-performance XGBoost model based on routine health examination indicators and further implemented an accessible web-based calculator to support clinical decision-making in settings without direct access to cfPWV measurement. The calculator is available at: https://lgezijo5wyivqrnt2zrkcm.streamlit.app/.

## Full-text entities

- **Genes:** SLC17A5 (solute carrier family 17 member 5) [NCBI Gene 26503] {aka AST, ISSD, NSD, SD, SIALIN, SIASD}
- **Diseases:** stiffness (MESH:C566112)
- **Chemicals:** glucose (MESH:D005947)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12819839/full.md

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12819839/full.md

## References

42 references — full list in the complete paper: https://tomesphere.com/paper/PMC12819839/full.md

---
Source: https://tomesphere.com/paper/PMC12819839