# Research on a Predictive Model for Microalbuminuria in Type 2 Diabetes Based on Machine Learning and SHAP Analysis

**Authors:** Zixuan Liu, Zhuolin Zhou, Yu Sun, Xiaotian Du, Haixia Zhang, Cheng Ji

PMC · DOI: 10.1155/ije/6356560 · International Journal of Endocrinology · 2026-03-01

## TL;DR

This study uses machine learning to predict microalbuminuria in type 2 diabetes patients and identifies key risk factors using SHAP analysis for better clinical decision-making.

## Contribution

The novel contribution is an interpretable machine learning model for early microalbuminuria risk prediction in T2DM patients using SHAP analysis.

## Key findings

- The Light GBM model achieved an AUC of 0.85 for predicting microalbuminuria progression.
- SHAP analysis identified ten key features contributing to microalbuminuria risk prediction.

## Abstract

Type 2 diabetes mellitus (T2DM) is associated with kidney damage, with microalbuminuria (MAU) serving as an early marker indicating the risk of progression to severe renal and cardiovascular complications, and there is an urgent need for effective prediction tools to identify MAU risk in T2DM patients and prevent adverse outcomes. This study aims to develop a machine learning–based model to enhance the early identification of high‐risk individuals and facilitate timely, personalized interventions.

The electronic medical records of 4170 patients were retrospectively extracted from the diabetes special database of Nanjing Drum Tower Hospital (Ethics approval number: 2021‐403‐02). The data were divided into training and testing sets (8:2 ratio), and random forest–based recursive feature elimination method was employed to identify the most pertinent input variables for the predictive model. Five machine learning models were applied to predict the progression to MAU. The Shapley additive explanations (SHAP) values were applied for model interpretation to assess feature contributions. Ten features were selected for the construction of a prediction model.

For predicting the progression to MAU, the Light GBM model demonstrated the best performance (AUC 0.85, 95% CI 0.82–0.88). By analyzing the Shapley values of the model outputs, we identified the key risk factors for predicting the diagnosis of MAU at both the cohort and individual levels.

This study developed an interpretable machine learning model to predict MAU in T2DM patients, enabling effective risk stratification and identification of high‐risk individuals based on baseline data to guide personalized clinical interventions and optimization of treatment.

## Linked entities

- **Diseases:** Type 2 diabetes mellitus (MONDO:0005148)

## Full-text entities

- **Genes:** ALPP (alkaline phosphatase, placental) [NCBI Gene 250] {aka ALP, PALP, PLAP, PLAP-1}, ITIH2 (inter-alpha-trypsin inhibitor heavy chain 2) [NCBI Gene 3698] {aka H2P, ITI-HC2, SHAP}, GLP1R (glucagon like peptide 1 receptor) [NCBI Gene 2740] {aka GLP-1, GLP-1-R, GLP-1R}, INS (insulin) [NCBI Gene 3630] {aka IDDM, IDDM1, IDDM2, ILPR, IRDN, MODY10}, ALB (albumin) [NCBI Gene 213] {aka FDAHT, HSA, PRO0883, PRO0903, PRO1341}
- **Diseases:** acute kidney injury (MESH:D058186), metabolic disorder (MESH:D008659), Proteinuria (MESH:D011507), Dyslipidemia (MESH:D050171), inflammation (MESH:D007249), metabolic syndrome (MESH:D024821), hyperglycemia (MESH:D006943), NLR (MESH:D015467), GBM (MESH:D005910), interstitial fibrosis (MESH:D005355), renal function decline (MESH:D060825), malignant tumors (MESH:D009369), Vascular Disease (MESH:D014652), Diabetes (MESH:D003920), liver dysfunction (MESH:D017093), T2DM (MESH:D003924), glomerular damage (MESH:D007674), diabetic kidney damage (MESH:D003928), Hypertension (MESH:D006973), glomerulonephritis (MESH:D005921), anemia (MESH:D000740), albuminuria (MESH:D000419), insulin resistance (MESH:D007333), cardiovascular and renal terminal organ damage (MESH:D002318), end-stage renal disease (MESH:D007676)
- **Chemicals:** metformin (MESH:D008687), cholesterol (MESH:D002784), C-peptide (MESH:D002096), blood glucose (MESH:D001786), P (MESH:D010758), Met (MESH:D008715), TG (MESH:D014280), UA (MESH:D014527), lipid (MESH:D008055), Mg (MESH:D008274), creatinine (MESH:D003404), glucose (MESH:D005947), Ca (MESH:D002118), potassium (MESH:D011188), GAG (MESH:D006025), Na (MESH:D012964), ACEi (-)
- **Species:** Mus musculus (house mouse, species) [taxon 10090], Homo sapiens (human, species) [taxon 9606]
- **Mutations:** AUC of 0, A1C

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12950826/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12950826/full.md

## References

45 references — full list in the complete paper: https://tomesphere.com/paper/PMC12950826/full.md

---
Source: https://tomesphere.com/paper/PMC12950826