# Machine learning for predicting distant metastasis in nasopharyngeal carcinoma patients

**Authors:** Hong Sun, Jijie Zhu, Ling Li, Xiu Xin, Jingchao Yan, Taomin Huang

PMC · DOI: 10.3389/fimmu.2025.1580200 · Frontiers in Immunology · 2025-06-05

## TL;DR

This study uses machine learning to identify risk factors for distant metastasis in nasopharyngeal carcinoma patients, helping to identify high-risk individuals for early intervention.

## Contribution

The novel contribution is the application of machine learning to predict distant metastasis in NPC patients and identifying key risk factors using SHAP analysis.

## Key findings

- Logistic Regression achieved the best predictive performance with an AUC of 0.8499.
- Key risk factors include targeted therapy, immunotherapy, N stage, EBV, hypertension, T stage, LY, and LDH.

## Abstract

Distant metastasis is the main cause of treatment failure and death in patients with nasopharyngeal carcinoma (NPC). The aim of this study was to explore the risk factors for distant metastasis in NPC patients using machine learning (ML) methods.

We collected data from NPC patients who were treated at the Eye Ear Nose Throat Hospital of Fudan University between September 2017 and June 2024. Seven ML methods were employed to construct the predictive models. By comparing the predictive performance of different ML models, the best one was selected to establish a predictive model for distant metastasis of NPC. The SHapley Additive exPlanation (SHAP) method was utilized to ascertain the ranking of feature importance and to provide explanations for the predictive model.

A total of 1,845 NPC patients were included in this study. Among the seven models, Logistic Regression (LR) performed best in the test dataset (Area Under the ROC Curve [AUC] = 0.8499). SHAP analysis indicated that the most important variables for distant metastasis in NPC patients were targeted therapy, immunotherapy, N stage, Epstein-Barr virus (EBV), hypertension, T stage, lymphocyte count (LY) and lactate dehydrogenase (LDH) level.

Targeted therapy, N stage, immunotherapy, EBV, hypertension, T stage, LY and LDH level are significantly associated with the risk of distant metastasis in NPC and could be used to identify high-risk populations for distant metastasis in NPC patients. For high-risk patients, early interventions such as targeted therapy and immunotherapy might be considered to reduce the risk of distant metastasis in NPC.

## Linked entities

- **Diseases:** nasopharyngeal carcinoma (MONDO:0015459)

## Full-text entities

- **Diseases:** death (MESH:D003643), hypertension (MESH:D006973), Distant metastasis (MESH:D009362), NPC (MESH:D000077274)
- **Species:** Homo sapiens (human, species) [taxon 9606], human gammaherpesvirus 4 (Epstein Barr virus, no rank) [taxon 10376]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12176861/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12176861/full.md

## References

34 references — full list in the complete paper: https://tomesphere.com/paper/PMC12176861/full.md

---
Source: https://tomesphere.com/paper/PMC12176861