# Interpretable machine learning for chronic kidney disease prediction: Insights from SHAP and LIME analyses

**Authors:** El Mehdi Chouit, Mohamed Rachdi, Mostafa Bellafkih, Brahim Raouyane, Md. Mehedi Hassan, Shahid Akbar, Polat Goktas, Mohammad A. Al-Mamun, Julfikar Haider, Francisco Alvarez Gonzalez, Francisco Alvarez Gonzalez

PMC · DOI: 10.1371/journal.pone.0343205 · PLOS One · 2026-02-26

## TL;DR

This study uses interpretable machine learning to predict chronic kidney disease, offering insights into key factors and improving transparency in healthcare AI.

## Contribution

A novel interpretable machine learning framework for CKD prediction using SHAP and LIME with high accuracy and clinical relevance.

## Key findings

- XGBoost with SMOTE achieved 88.4% accuracy on UAE hospital data and 94.6% on UCI data.
- SHAP identified eGFRBaseline, HbA1c, and CholesterolBaseline as key predictors in the hospital cohort.
- LIME provided patient-level explanations that aligned with global SHAP patterns, confirming model reliability.

## Abstract

Chronic kidney disease (CKD) is a progressive condition requiring early detection for optimal patient outcomes. This study developed an interpretable machine learning framework using XGBoost with SHapley Additive exPlanations (SHAP) and Local Interpretable Model-agnostic Explanations (LIME) for transparent CKD prediction. We evaluated the approach on two datasets: UAE Tawam Hospital data (n = 491) and UCI CKD data (n = 400).

XGBoost with SMOTE optimization achieved 88.4% accuracy (AUC = 0.904) on the hospital dataset and 94.6% accuracy (AUC = 0.948) on the UCI dataset after Rigorous overfitting prevention through conservative hyperparameter ranges and performance monitoring ensured clinical credibility. SHAP analysis identified clinically relevant predictors: eGFRBaseline, HbA1c, and CholesterolBaseline for the hospital cohort, and specific gravity, hemoglobin, and serum creatinine for the UCI cohort. LIME provided complementary patient-level explanations that validated global SHAP patterns.

The convergence between global and local interpretability methods confirms model reliability across diverse clinical contexts. This framework addresses the transparency barrier to machine learning adoption in healthcare while maintaining clinically realistic performance levels. The approach provides a foundation for integrating interpretable artificial intelligence into CKD screening and management workflows.

## Linked entities

- **Diseases:** chronic kidney disease (MONDO:0005300)

## Full-text entities

- **Genes:** LIME1 (Lck interacting transmembrane adaptor 1) [NCBI Gene 54923] {aka LIME, dJ583P15.4}, SHROOM4 (shroom family member 4) [NCBI Gene 57477] {aka MRXSSDS, SHAP, shrm4}, SYNM (synemin) [NCBI Gene 23336] {aka DMN, SYN}, ALB (albumin) [NCBI Gene 213] {aka FDAHT, HSA, PRO0883, PRO0903, PRO1341}
- **Diseases:** inflammatory (MESH:D007249), ANE (MESH:D000740), Disease (MESH:D004194), obesity (MESH:D009765), retinoblastoma (MESH:D012175), Kidney Disease (MESH:D007674), albuminuria (MESH:D000419), DLD (MESH:D050171), SMOTE (MESH:D006963), deaths (MESH:D003643), ML (MESH:D007859), Polycystic Ovary Syndrome (MESH:D011085), HTN (MESH:D006973), CHD (MESH:D003327), dementia (MESH:D003704), CAD (MESH:D003324), renal comorbidity (MESH:D006030), CVD (MESH:D002318), diabetic nephropathy (MESH:D003928), metabolic dysregulation (MESH:D021081), DM (MESH:D003920), ESRD (MESH:D007676), vascular disease (MESH:D014652), COVID-19 (MESH:D000086382), EPS (MESH:D001480), XAI (MESH:C538243), PE (MESH:D004487), CKD (MESH:D051436)
- **Chemicals:** Al (MESH:D000535), -D-24-28794R4 (-), POT (MESH:D011188), SOD (MESH:D012964), urea (MESH:D014508), Cholesterol (MESH:D002784), triglyceride (MESH:D014280), glucose (MESH:D005947), creatinine (MESH:D003404)
- **Species:** Homo sapiens (human, species) [taxon 9606]
- **Mutations:** A1C

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12944776/full.md

## Figures

11 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12944776/full.md

## References

37 references — full list in the complete paper: https://tomesphere.com/paper/PMC12944776/full.md

---
Source: https://tomesphere.com/paper/PMC12944776