# Construction and validation of a cross-sectional risk classification model for hypoproteinemia in single-center maintenance hemodialysis patient

**Authors:** Kun Zhang, Xiaohan Qu, Shuai Xu, Lu Xu, Xinjian Li, Lei Liu

PMC · DOI: 10.1038/s41598-025-19913-8 · Scientific Reports · 2025-10-15

## TL;DR

This study builds a machine learning model to predict hypoproteinemia risk in hemodialysis patients, aiming to improve early clinical interventions.

## Contribution

A novel SVM-based classification model with SHAP interpretation for hypoproteinemia risk in MHD patients is developed and validated.

## Key findings

- The SVM model achieved an AUC of 0.937 in predicting hypoproteinemia risk.
- Thirteen key features, including globulin and hemoglobin, were identified as significant predictors.
- SHAP interpretation enhanced model transparency and supported personalized risk classification.

## Abstract

Hypoproteinemia is a common complication across patients receiving maintenance hemodialysis (MHD). Moreover, it is associated with increased risks of cardiovascular events, infection risk, and mortality. This study aimed to construct a classification model for identifying the risk of hypoproteinemia in MHD patients to support precise clinical interventions. To this end, a retrospective analysis was conducted on 288 MHD patients at the Affiliated Hospital of Jining Medical University (January–December 2023). Hypoproteinemia was defined as the primary outcome. Four machine learning (ML) models—Logistic Regression (LR), Random Forest (RF), Support Vector Machine (SVM), and eXtreme Gradient Boosting (XGBoost)—were trained and validated using 3-fold cross-validation. Feature selection was performed using the Least Absolute Shrinkage and Selection Operator (LASSO). Model performance was evaluated using area under the receiver operating characteristic curve (AUC), accuracy, sensitivity, specificity, balanced F-score, and Brier score. SHapley Additive exPlanations (SHAP) was used to interpret the model. The SVM model demonstrated the highest identification performance with an AUC of 0.937. Thirteen key features were identified, with globulin, hemoglobin, β2 microglobulin, prealbumin, and hemodialysis mode being the most significant. In conclusion, the classification model based on ML can accurately identify the risk of hypoproteinemia in MHD patients and provide targeted guidance for early clinical intervention. The combination of SVM and SHAP not only enhances model interpretability but also establishes a scientific foundation for personalized risk classification and the development of precision diagnosis and treatment strategies by visualizing the dynamic effects of key features.

The online version contains supplementary material available at 10.1038/s41598-025-19913-8.

## Full-text entities

- **Genes:** HLA-G (major histocompatibility complex, class I, G) [NCBI Gene 3135] {aka MHC-G}
- **Diseases:** Hypoproteinemia (MESH:D007019), infection (MESH:D007239)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12528663/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12528663/full.md

## References

1 references — full list in the complete paper: https://tomesphere.com/paper/PMC12528663/full.md

---
Source: https://tomesphere.com/paper/PMC12528663