# Predicting carotid plaques in metabolic dysfunction-associated steatotic liver disease using machine learning and SHAP interpretation

**Authors:** Shu-Mei Zhai, Xiao-Long Wang, Han Zhang, Yu-Qiang Zuo

PMC · DOI: 10.1038/s41598-025-19959-8 · Scientific Reports · 2025-10-15

## TL;DR

This study uses machine learning to predict carotid plaques in people with liver disease, helping identify those at higher risk for heart issues.

## Contribution

The study introduces a machine learning model with SHAP interpretation for predicting carotid plaques in MASLD patients.

## Key findings

- The support vector machine model outperformed other algorithms in predicting carotid plaques.
- Age, blood pressure, cholesterol, sex, and glucose were key risk factors identified by SHAP analysis.
- The model can help clinicians target interventions for high-risk MASLD patients.

## Abstract

Cardiovascular disease (CVD) remains the most common cause of death worldwide. Carotid plaque is an indicator of subclinical CVDs. Metabolic dysfunction-associated steatotic liver disease (MASLD) is a risk factor for atherosclerotic CVDs. We aimed to develop and validate a predictive model for carotid plaque occurrence in annual health check-up populations, to integrate health check-up indicators with machine learning (ML) algorithms and LASSO-based feature selection and leverage advanced interpretability frameworks to elucidate the contribution of individual risk factors. In this retrospective cohort study, we enrolled 4,973 MASLD patients, among whom 1,178 were diagnosed with carotid plaques using carotid ultrasound. Collected baseline data included ​demographic indicators, ​clinical histories, blood ​biochemical parameters, and liver function test indicators. A predictive model for carotid plaques was developed and validated using five ML algorithms. Model performance was evaluated based on the​ area under the curve, ​sensitivity, ​specificity, ​accuracy, and ​F1 Score. For model interpretability, we adopted the ​Shapley Additive Explanations (SHAP) framework to quantify the contribution of individual features to the prediction outcomes. Among the five ML algorithm models, the support vectors machine model demonstrated superior discriminative capability, higher goodness-of-fit, and greater clinical utility compared to other ML algorithm models. Moreover, age, systolic blood pressure, total cholesterol, sex, and fasting plasma glucose were the most important risk factors associated with carotid plaques in the MASLD population. This study demonstrated the feasibility of constructing a predictive model for carotid plaques in MASLD populations using health check-up indicators combined with ML algorithms. The application of SHAP methods enhanced model interpretability by quantifying the contribution of individual risk factors to prediction outcomes, enabling clinicians to identify high risk MASLD patients prone to carotid plaque development, so that they can adjust interventions accordingly.

## Linked entities

- **Diseases:** cardiovascular disease (MONDO:0004995), metabolic dysfunction-associated steatotic liver disease (MONDO:0013209)

## Full-text entities

- **Diseases:** death (MESH:D003643), MASLD (MESH:D008107), CVD (MESH:D002318), carotid plaques (MESH:D016893), atherosclerotic CVDs (MESH:D050197)
- **Chemicals:** cholesterol (MESH:D002784), glucose (MESH:D005947)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12528719/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12528719/full.md

## References

2 references — full list in the complete paper: https://tomesphere.com/paper/PMC12528719/full.md

---
Source: https://tomesphere.com/paper/PMC12528719