# A machine learning-based model analysis for serum markers of liver fibrosis in chronic hepatitis B patients

**Authors:** Congjie Zhang, Zhenyu Shu, Shanshan Chen, Jiaxuan Peng, Yueyue Zhao, Xuan Dai, Jie Li, Xuehan Zou, Jianhua Hu, Haijun Huang

PMC · DOI: 10.1038/s41598-024-63095-8 · Scientific Reports · 2024-05-27

## TL;DR

This study uses machine learning to predict liver fibrosis stages in chronic hepatitis B patients using five serum markers, offering a reliable and accurate diagnostic tool.

## Contribution

A novel machine learning model using five serum markers for accurate liver fibrosis staging in chronic hepatitis B patients.

## Key findings

- The decision tree model achieved high AUC values (0.898–0.944) for fibrosis staging in the training cohort.
- The model showed strong external validation performance with AUC values of 0.906–0.933.
- The model's classifications closely matched pathological diagnoses, supporting its clinical utility.

## Abstract

Early assessment and accurate staging of liver fibrosis may be of great help for clinical diagnosis and treatment in patients with chronic hepatitis B (CHB). We aimed to identify serum markers and construct a machine learning (ML) model to reliably predict the stage of fibrosis in CHB patients. The clinical data of 618 CHB patients between February 2017 and September 2021 from Zhejiang Provincial People's Hospital were retrospectively analyzed, and these data as a training cohort to build the model. Six ML models were constructed based on logistic regression, support vector machine, Bayes, K-nearest neighbor, decision tree (DT) and random forest by using the maximum relevance minimum redundancy (mRMR) and gradient boosting decision tree (GBDT) dimensionality reduction selected features on the training cohort. Then, the resampling method was used to select the optimal ML model. In addition, a total of 571 patients from another hospital were used as an external validation cohort to verify the performance of the model. The DT model constructed based on five serological biomarkers included HBV-DNA, platelet, thrombin time, international normalized ratio and albumin, with the area under curve (AUC) values of the DT model for assessment of liver fibrosis stages (F0-1, F2, F3 and F4) in the training cohort were 0.898, 0.891, 0.907 and 0.944, respectively. The AUC values of the DT model for assessment of liver fibrosis stages (F0-1, F2, F3 and F4) in the external validation cohort were 0.906, 0.876, 0.931 and 0.933, respectively. The simulated risk classification based on the cutoff value showed that the classification performance of the DT model in distinguishing hepatic fibrosis stages can be accurately matched with pathological diagnosis results. ML model of five serum markers allows for accurate diagnosis of hepatic fibrosis stages, and beneficial for the clinical monitoring and treatment of CHB patients.

## Linked entities

- **Diseases:** chronic hepatitis B (MONDO:0005344)

## Full-text entities

- **Genes:** F2 (coagulation factor II, thrombin) [NCBI Gene 2147] {aka PT, RPRGL2, THPH1}
- **Diseases:** hepatic fibrosis (MESH:D008103), fibrosis (MESH:D005355), CHB (MESH:D019694)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11130122/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11130122/full.md

## References

31 references — full list in the complete paper: https://tomesphere.com/paper/PMC11130122/full.md

---
Source: https://tomesphere.com/paper/PMC11130122