# Forecasting Stone-Free Status Following Percutaneous Nephrolithotomy Utilizing Explainable Machine Learning

**Authors:** Resul Çiçek, İbrahim Topçu, Bulut Dural, İpek Balıkçı Çiçek, Murat Yılmaz, Cemil Çolak

PMC · DOI: 10.3390/jcm15041380 · 2026-02-10

## TL;DR

This study uses explainable machine learning to predict whether patients will be stone-free after a kidney stone surgery called PNL.

## Contribution

The study introduces an explainable machine learning model, particularly XGBoost, for predicting post-PNL stone-free outcomes with clinical transparency.

## Key findings

- XGBoost achieved the highest accuracy (0.916) and ROC–AUC (0.975) among tested models.
- SHAP analysis revealed anatomical anomalies as the strongest predictor of stone-free outcomes.
- Model features like stone placement and operation duration showed clinical relevance.

## Abstract

Background: This study aimed to create and evaluate explainable machine learning models for forecasting postoperative stone-free status following percutaneous nephrolithotomy (PNL) utilizing a substantial clinical cohort. Methods: This retrospective single-center analysis encompassed 2144 adult patients who received PNL from 2010 to 2024. We employed clinical, radiographic, stone-related, and surgical data to train four supervised machine learning models: Extreme Gradient Boosting (XGBoost), Random Forest, Light Gradient Boosting Machine (LightGBM), and Adaptive Boosting (AdaBoost). We used the Synthetic Minority Oversampling Technique exclusively on the training set to fix the class imbalance. We assessed the model’s accuracy, precision, recall, F1-score, and area under the receiver operating characteristic curve (ROC–AUC) to see how well it worked. SHapley Additive exPlanations (SHAP) were used to measure explainability. Results: The total stone-free rate was 84.8%. XGBoost had the best predictive performance of the models tested, with an accuracy of 0.916 and a ROC–AUC of 0.975. LightGBM was close behind. Random Forest and AdaBoost had relatively inferior performance. SHAP analysis identified anatomical anomalies as demonstrated the strongest association with stone-free outcomes. The size of the access sheath and the number of stones were next. Other parameters that were identified by SHAP as important contributors to model predictions were the placement of the stone, Guy’s Stone Score, the length of the operation, and the density of the stone. These feature associations demonstrated clinical coherence with established knowledge in surgical practice. Conclusions: Explainable machine learning algorithms, especially XGBoost, can accurately predict stone-free outcomes following PNL in a way that makes sense to doctors. The incorporation of SHAP improves transparency and facilitates the prospective application of these models as decision-support instruments in personalized surgical planning.

## Full-text entities

- **Genes:** SHROOM4 (shroom family member 4) [NCBI Gene 57477] {aka MRXSSDS, SHAP, shrm4}
- **Diseases:** diabetes mellitus (MESH:D003920), Urolithiasis (MESH:D052878), staghorn calculi (MESH:D000069856), Stone (MESH:D007669), injury to (MESH:D014947), GSS (MESH:D016098), II (MESH:C537730), blood (MESH:D006402), hypertension (MESH:D006973), horseshoe (MESH:D000069337), urological condition (MESH:D014570), congenital renal or collecting system abnormalities (MESH:D002292), bleeding (MESH:D006470), renal malrotation (MESH:C562456), I (MESH:D006969), coronary artery disease (MESH:D003324), ureteropelvic junction obstruction (MESH:C537373)
- **Chemicals:** cystine (MESH:D003553), creatinine (MESH:D003404), brushite (MESH:C494366), calcium oxalate (MESH:D002129)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12941648/full.md

---
Source: https://tomesphere.com/paper/PMC12941648