# Machine learning model based on routine blood and biochemical parameters for early diagnosis of diabetic kidney disease

**Authors:** Wei Yong, Dan-dan Peng, Kai Ye, Jun-jie Gao, Ruo-xue Cao

PMC · DOI: 10.3389/fendo.2026.1720574 · Frontiers in Endocrinology · 2026-01-28

## TL;DR

This study developed a machine learning model using routine blood tests to detect early diabetic kidney disease, which could improve early diagnosis and treatment.

## Contribution

The novel contribution is developing and validating a machine learning model for early DKD detection using routine clinical parameters.

## Key findings

- Logistic regression achieved optimal performance with an AUC of 0.689 in detecting early DKD.
- Key predictors included the triglyceride-glucose index (TyG), HbA1c, and globulin.

## Abstract

Diabetic kidney disease (DKD) is the leading cause of end-stage renal disease globally, yet early diagnosis remains challenging due to conventional biomarker limitations, including UACR variability and reduced eGFR sensitivity. While machine learning shows promise in diabetes prediction, its application to early DKD identification using routine parameters remains underexplored. This study aimed to develop and validate machine learning models incorporating routine blood and biochemical parameters for early DKD prediction.

This retrospective study analyzed 3,114 diabetic patients from the Second Affiliated Hospital of Wannan Medical College (EDN1) and 1,496 patients from NHANES 2005-2018 (EDN2) for external validation. Early DKD was defined as UACR 30–300 mg/g with eGFR ≥60 ml/min/1.73m². Seven machine learning algorithms were compared. Feature importance was assessed using SHAP framework, and Mendelian randomization explored causal relationships.

Among 3,114 patients, 1,333 (42.8%) had early DKD. Logistic regression achieved optimal performance (AUC = 0.689, sensitivity=40.5%, specificity=81.3%). Top predictors included triglyceride-glucose index (TyG), gender, creatinine, globulin, and age. External validation confirmed significant associations for HbA1c, globulin, TyG, and neutrophil-to-albumin ratio.

The machine learning model successfully identified early DKD using routine parameters, with TyG index, HbA1c, and globulin as key predictors, demonstrating potential as a cost-effective screening tool.

## Linked entities

- **Diseases:** diabetic kidney disease (MONDO:0005016), end-stage renal disease (MONDO:0004375)

## Full-text entities

- **Genes:** ALB (albumin) [NCBI Gene 213] {aka FDAHT, HSA, PRO0883, PRO0903, PRO1341}
- **Diseases:** end-stage renal disease (MESH:D007676), diabetes (MESH:D003920), DKD (MESH:D003928)
- **Chemicals:** triglyceride (MESH:D014280), creatinine (MESH:D003404), glucose (MESH:D005947)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12890677/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12890677/full.md

## References

34 references — full list in the complete paper: https://tomesphere.com/paper/PMC12890677/full.md

---
Source: https://tomesphere.com/paper/PMC12890677