# Predicting preoperative lymph node status in patients with cervical cancer: development of interpretable machine learning model and support for the biological plausibility

**Authors:** Hui Shen, Yuting Jiang, Lihe Zhang, Qiao Zheng, Han Bai, Lihong Wu, Liu Du, Hongning Xie

PMC · DOI: 10.3389/fimmu.2025.1654332 · Frontiers in Immunology · 2025-10-10

## TL;DR

This study develops interpretable machine learning models to predict lymph node metastasis in cervical cancer patients and supports the biological role of monocytes in cancer progression.

## Contribution

The novel contribution is the development of interpretable machine learning models validated with both bulk and single-cell RNA-sequencing data to predict lymph node metastasis in cervical cancer.

## Key findings

- Monocyte count was identified as a significant risk factor for lymph node metastasis (OR=2.28, p < 0.05).
- Single-cell RNA-sequencing showed increased monocytes at IIIC1 stage compared to earlier stages.
- The NNET model achieved strong predictive performance (AUC up to 0.86) and was validated on prospective data.

## Abstract

Lymph node metastasis serves as a crucial prognostic risk factor for patients with cervical cancer. Accurate prediction of lymph node metastasis is important in guiding treatment selection. Therefore, our primary objective is the development and validation of machine learning models for predicting lymph node metastasis; the secondary objective is to utilize the sequencing data to provide biological plausibility.

This study retrospectively included 292 cervical cancer patients and prospectively recruited 54 cervical cancer patients. Univariate and multivariate analysis were conducted to explore the risk factors associated with lymph node metastasis. Subsequently, cellular-level validation was performed using single cell RNA-sequencing data. The prognostic value of the risk factor was assessed through bulk RNA-sequencing analysis. Finally, patients were divided into train and retrospective test sets in a 7:3 ratio to develop five machine learning models, while using the prospective test set to validate the models. Additionally, the Shapley Additive Explanation method was employed to enhance the interpretability of the models’ decision processes.

Federation of Gynecology and Obstetrics stage (2018), squamous cell carcinoma antigen, monocyte count and platelet count were found to be significantly correlated with lymph node metastasis. Meanwhile, monocyte count was a significant risk factor (OR=2.28, p < 0.05). Single cell RNA-sequencing analysis revealed an increase in monocytes at IIIC1 stage compared to IB and IIB stages. Monocytes were significantly associated with prognosis and lymph node metastasis in the bulk RNA-sequencing. Finally, we developed and validated five machine learning models for predicting lymph node metastasis. The NNET model stood for its ability to predict lymph node metastasis (train set AUC: 0.86; retrospective test set AUC: 0.79; prospective test set: 0.76). In the interpretability of machine learning models, Shapley Additive Explanation values demonstrated the concrete contribution of each feature within the NNET model.

This study investigated the notable association between monocyte count and lymph node metastasis, highlighting the importance of monocytes in cervical cancer via bulk RNA-sequencing and single cell RNA-sequencing analysis. The developed interpretable machine learning models effectively aid clinicians in decision-making processes. Additionally, the Shapley Additive Explanation method improved the applicability of these machine learning models in real world.

## Linked entities

- **Diseases:** cervical cancer (MONDO:0002974)

## Full-text entities

- **Diseases:** Lymph node metastasis (MESH:D008207), cervical cancer (MESH:D002583), squamous cell carcinoma (MESH:D002294)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12549587/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12549587/full.md

## References

33 references — full list in the complete paper: https://tomesphere.com/paper/PMC12549587/full.md

---
Source: https://tomesphere.com/paper/PMC12549587