# Screening for Clinically Significant Nephrolithiasis Based on Simple Health Checkup Clinical and Urine Parameters in General Populations: Multicenter Machine Learning Study

**Authors:** Hao-Wei Chen, Pei-Siou Wei, Yu-Chen Chen, Jeng-Yih Wu, Chia-I Lin, Yii-Her Chou, Yung-Shun Juan, Wen-Jeng Wu, Chung-Yao Kao, Jung-Ting Lee

PMC · DOI: 10.2196/80764 · JMIR Medical Informatics · 2026-02-19

## TL;DR

This study developed a machine learning model to screen for kidney stones using simple health data, offering a low-cost and non-invasive alternative to traditional imaging methods.

## Contribution

A novel machine learning model for kidney stone screening using routine clinical and urine parameters in general populations.

## Key findings

- The best-performing model achieved an AUC-ROC of 0.968 and AUC-PR of 0.936 for predicting clinically significant nephrolithiasis.
- Urine red blood cell count, estimated glomerular filtration rate, and urine specific gravity were the top predictors identified by Shapley value analysis.
- The model offers a non-invasive, scalable solution for kidney stone screening, suitable for integration into health checkups or telemedicine.

## Abstract

Nephrolithiasis affects approximately 15% of the population and often remains undetected in asymptomatic individuals. Current diagnostic approaches rely on imaging tools, such as ultrasound or computed tomography, which are costly, operator dependent, or involve radiation, making them unsuitable for large-scale screening. A standardized, practical, and low-cost screening strategy for early identification of clinically significant kidney stones is still lacking.

This study aimed to develop a low-cost, rapid screening model for clinically significant nephrolithiasis using machine learning (ML) and simple clinical parameters.

We conducted a multihospital retrospective study using data from 3 hospitals in Kaohsiung, Taiwan (2012‐2021). Adults without renal colic were included. ML models were trained and tested using 10 routine variables: sex, age, BMI, gout, diabetes, estimated glomerular filtration rate, urine pH, red blood cell count, specific gravity, and bacteriuria. Multiple ML algorithms were trained and evaluated, and the best-performing model was selected based on the area under the receiver operating characteristic curve and the area under the precision-recall curve. To assess model interpretability, Shapley value analysis was performed to determine the relative importance and contribution of each variable to the model’s predictive performance.

Among 6528 participants, the best-performing model achieved an area under the receiver operating characteristic curve of 0.968 (95% CI 0.956‐0.980), an area under the precision-recall curve of 0.936 (95% CI 0.918‐0.953), a sensitivity of 0.873 (95% CI 0.841‐0.904), and a specificity of 0.947 (95% CI 0.935‐0.959). Shapley value analysis identified urine red blood cell count, estimated glomerular filtration rate, and urine specific gravity as the 3 most influential predictors.

This ML-based model enables efficient, noninvasive, and large-scale kidney stone screening using routine health data. It can be integrated into health checkups or telemedicine platforms to facilitate early detection and proactive management. Although the model was developed using an Asian population, future validation in diverse cohorts is warranted to confirm its generalizability.

## Linked entities

- **Diseases:** nephrolithiasis (MONDO:0008171), gout (MONDO:0005393), diabetes (MONDO:0005015)

## Full-text entities

- **Diseases:** overweight (MESH:D050177), obese (MESH:D009765), Bacteriuria (MESH:D001437), chronic kidney disease (MESH:D051436), cancers (MESH:D009369), diabetes (MESH:D003920), gout (MESH:D006073), hematuria (MESH:D006417), flank pain (MESH:D021501), Nephrolithiasis (MESH:D053040), sepsis (MESH:D018805), renal colic (MESH:D056844), Renal Disease (MESH:D007674), urolithiasis (MESH:D052878), urinary tract infections (MESH:D014552), kidney and ureteral stones (MESH:D007669), colorectal cancer (MESH:D015179)
- **Chemicals:** oxalate (MESH:D010070), calcium (MESH:D002118), creatinine (MESH:D003404), citrate (MESH:D019343)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12919908/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12919908/full.md

## References

27 references — full list in the complete paper: https://tomesphere.com/paper/PMC12919908/full.md

---
Source: https://tomesphere.com/paper/PMC12919908