# Machine learning models based on routine blood and biochemical test data for diagnosis of neurological diseases

**Authors:** Wanshan Ning, Zhicheng Wang, Ying Gu, Lindan Huang, Shuai Liu, Qun Chen, Yunyun Yang, Guolin Hong

PMC · DOI: 10.1038/s41598-025-09439-4 · Scientific Reports · 2025-07-30

## TL;DR

This study uses machine learning on blood test data to diagnose neurological diseases, showing high accuracy and cost-effectiveness.

## Contribution

A novel XGBoost-based model using routine blood and biochemical data achieves high diagnostic accuracy for neurological diseases.

## Key findings

- The XGBoost model achieved an AUC of 0.9782 for predicting neurological diseases.
- The model distinguishing neuromyelitis optica had the best AUC of 0.9095.
- Biochemical test features were most important for model predictions.

## Abstract

Globally, nervous system diseases are the leading cause of disability-adjusted life-years and the second leading cause of mortality in the world. Traditional diagnostic methods for nervous system diseases are expensive. So this study aimed to construct machine learning models using the convenient blood routine and biochemical detection data for diagnosis of nervous system diseases. After the data preprocessing, 25,794 healthy people and 7518 nervous system disease patients with the blood routine and biochemical detection data were utilized for our study. We selected logistic regression, random forest, support vector machine, eXtreme Gradient Boosting (XGBoost), and deep neural network to construct models. Finally, the SHAP algorithm was used to interpret models. The nervous system disease prediction model constructed by XGBoost possessed the best performance (AUC: 0.9782). And the most models of distinguishing various nervous system diseases also had good performance, the model performance of distinguishing neuromyelitis optica from other nervous system diseases was the best (AUC: 0.9095). The model interpretation by SHAP algorithm indicated features from biochemical detection made major contributions to predicting nervous system disease. The present study constructed multiple models using 52 features from the blood routine and biochemical detection data for diagnosis of various nervous system diseases. Meanwhile, distinct hematologic features of various nervous system diseases also were explored. This cost-effective work will benefit more people and assist in diagnosis and prevention of nervous system diseases.

## Linked entities

- **Diseases:** neuromyelitis optica (MONDO:0019100)

## Full-text entities

- **Genes:** SHROOM4 (shroom family member 4) [NCBI Gene 57477] {aka MRXSSDS, SHAP, shrm4}
- **Diseases:** neuromyelitis optica (MESH:D009471), nervous system disease (MESH:D009422), neurological diseases (MESH:D020271)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12311004/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12311004/full.md

## References

8 references — full list in the complete paper: https://tomesphere.com/paper/PMC12311004/full.md

---
Source: https://tomesphere.com/paper/PMC12311004