# Exploring predictive factors of physiological, biochemical indicators, and lifestyle for macrovascular complications in type 2 diabetes: a synthesis of machine learning models

**Authors:** Bo Shang, Yaoqin Lu, Chengjing Wei, Yunlong Li, Qingyue Yang, Yukai Li, Sheng Jiang, Yinxia Su

PMC · DOI: 10.3389/fendo.2026.1696240 · Frontiers in Endocrinology · 2026-02-17

## TL;DR

This study uses machine learning to identify key factors predicting macrovascular complications in type 2 diabetes patients, highlighting the importance of BMI, body temperature, and lifestyle behaviors.

## Contribution

The study introduces a novel synthesis of machine learning models to predict macrovascular complications in T2DM, emphasizing overlooked variables like body temperature and health behavior discrepancies.

## Key findings

- XGBoost outperformed other models with a validation AUC of 0.850, showing superior clinical net benefit.
- BMI, body temperature, and LDL-C were identified as key predictive factors for macrovascular complications.
- Patients with complications showed a disconnect between health behaviors and self-reported health status.

## Abstract

Traditional risk models for macrovascular complications in type 2 diabetes (T2DM) rely on physiological and biochemical indicators, which may lack long-term follow-up data and thus potentially overlook key variables.

A retrospective cohort study was conducted on 4,186 T2DM patients from the Diabetes Health Management Platform in Hotan, Xinjiang, covering the period from 2015 to 2023. Eight machine learning (ML) algorithms were used, with an 8:2 random split into training (n=3,348) and validation (n=838) sets. Performance was evaluated using the area under the receiver operating characteristic curve (AUC), and feature contributions were analyzed using SHAP values. The clinical applicability was verified through decision curve analysis.

The T2DM with macrovascular complications group had significantly higher waist circumference, oropharyngeal abnormalities, and absent lung crackles (P < 0.05). The T2DM with macrovascular complications group also had significantly higher BMI, body temperature, and ALT (P < 0.05), but lower fasting blood glucose, with borderline abnormalities in blood urea and AST. The T2DM with macrovascular complications group had higher smoking, alcohol consumption, and exercise frequency (P < 0.05), but a reverse trend in self-reported “poor” health status (P < 0.05). Among all machine learning models (training AUC 0.68-0.85), XGBoost performed best (training AUC = 0.830, validation AUC = 0.850), with superior clinical net benefit compared to traditional strategies. SHAP analysis revealed that BMI (contribution +0.1116), body temperature (+0.0923), and LDL-C (+0.0821) were key predictive factors, with elevated body temperature potentially indicating subclinical inflammation activation.

Among patients with vascular complications, the disconnect between health behavior risks and subjective health perception is more pronounced. Elevated body temperature, high blood pressure, triglycerides, and fasting glucose indicate inflammation, increasing cardiovascular risk; moderate regular exercise provides protection.

## Linked entities

- **Diseases:** type 2 diabetes (MONDO:0005148)

## Full-text entities

- **Genes:** GPT (glutamic--pyruvic transaminase) [NCBI Gene 2875] {aka AAT1, ALT, ALT1, GPT1, SGPT}, TNF (tumor necrosis factor) [NCBI Gene 7124] {aka DIF, IMD127, TNF-alpha, TNFA, TNFSF2, TNLG1F}, GLP1R (glucagon like peptide 1 receptor) [NCBI Gene 2740] {aka GLP-1, GLP-1-R, GLP-1R}, INS (insulin) [NCBI Gene 3630] {aka IDDM, IDDM1, IDDM2, ILPR, IRDN, MODY10}, SHROOM4 (shroom family member 4) [NCBI Gene 57477] {aka MRXSSDS, SHAP, shrm4}, SLC17A5 (solute carrier family 17 member 5) [NCBI Gene 26503] {aka AST, ISSD, NSD, SD, SIALIN, SIASD}, ITIH1 (inter-alpha-trypsin inhibitor heavy chain 1) [NCBI Gene 3697] {aka H1P, IATIH, ITI-HC1, ITIH, SHAP}, IL6 (interleukin 6) [NCBI Gene 3569] {aka BSF-2, BSF2, CDF, HGF, HSF, IFN-beta-2}, COG2 (component of oligomeric golgi complex 2) [NCBI Gene 22796] {aka CDG2Q, LDLC}, ALB (albumin) [NCBI Gene 213] {aka FDAHT, HSA, PRO0883, PRO0903, PRO1341}
- **Diseases:** cerebrovascular accidents (MESH:D020521), polyphagia (MESH:D006963), vascular complications (MESH:D003925), obesity (MESH:D009765), hypoglycemic (MESH:C000721848), Hearing Ability (MESH:D034381), metabolic abnormalities (MESH:D008659), dyslipidemia (MESH:D050171), hyperglycemia (MESH:D006943), macrovascular disease (MESH:D004194), HL (MESH:C538324), macrovascular complications (MESH:D008107), chronic inflammation (MESH:D007249), pancreatic dysfunction (MESH:D010195), oropharyngeal abnormalities (MESH:D009959), polyuria (MESH:D011141), Diabetes (MESH:D003920), pharyngeal structural abnormalities (MESH:D010608), endothelial dysfunction (MESH:D014652), lactic acidosis (MESH:D000140), arteriosclerosis (MESH:D001161), respiratory abnormalities (MESH:D015619), coronary artery disease (MESH:D003324), diabetic ketoacidosis (MESH:D016883), heart disease (MESH:D006331), diabetic coma (MESH:D003926), wheezing (MESH:D012135), kidney function impairment (MESH:D007674), vascular degeneration (MESH:D009410), peripheral artery disease (MESH:D058729), T2DM (MESH:D003924), neuropathy (MESH:D009422), diabetic nephropathy (MESH:D003928), impaired glucose tolerance (MESH:D018149), Hepatomegaly (MESH:D006529), atherosclerosis (MESH:D050197), hypertension (MESH:D006973), C (OMIM:211750), diabetic complications (MESH:D048909), diabetic foot (MESH:D017719), Diastolic Blood Pressure (MESH:D006337), Insulin resistance (MESH:D007333), cerebrovascular disease (MESH:D002561), vascular damage (MESH:D057772), polydipsia (MESH:D059606), weight loss (MESH:D015431), diabetic retinopathy (MESH:D003930), diabetic neuropathy (MESH:D003929), abnormal fasting glucose (MESH:D007003), Cardiovascular Disease (MESH:D002318)
- **Chemicals:** Blood glucose (MESH:D001786), Cholesterol (MESH:D002784), TC (MESH:D013667), TG (MESH:D014280), Bilirubin (MESH:D001663), lipid (MESH:D008055), Glucose (MESH:D005947), Creatinine (MESH:D003404), alcohol (MESH:D000438), LDL-C (-), Urea (MESH:D014508)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12955086/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12955086/full.md

## References

72 references — full list in the complete paper: https://tomesphere.com/paper/PMC12955086/full.md

---
Source: https://tomesphere.com/paper/PMC12955086