# CT-based multiple instance and ensemble learning for lymph node metastasis prediction in esophageal squamous cell carcinoma: a multicentre, retrospective study

**Authors:** Kai Qin, Jianchao Lu, Na Li, Jie Zhu, Lin Peng, Jiayu Xiang, Yi Jin, Pei Yang, Junqiang Chen, Qifeng Wang

PMC · DOI: 10.1186/s40644-026-01005-z · Cancer Imaging · 2026-02-14

## TL;DR

This study developed a CT-based model to predict lymph node metastasis in esophageal cancer patients, helping identify those who may benefit from additional treatment.

## Contribution

A novel Stacking model using multiple instance and ensemble learning improves lymph node metastasis prediction in ESCC.

## Key findings

- The Stacking model outperformed traditional models with AUC values of 0.883 in training and 0.819 in external validation.
- High-risk pN0 patients identified by the model had worse survival and benefited from postoperative adjuvant therapy.
- Low-risk pN0 patients did not show significant survival improvement with postoperative adjuvant therapy.

## Abstract

Accurately predicting lymph node metastasis (LNM) in esophageal squamous cell carcinoma (ESCC) is crucial for planning patient treatments. However, this task remains challenging, complicating treatment decision-making; this is particularly concerning for patients classified as pN0 owing to insufficient lymph node dissection (< 15 lymph nodes), as the effectiveness of postoperative adjuvant therapy (POAT) for these patients remains controversial. Therefore, we aimed to develop a CT-based predictive model to improve LNM detection in ESCC patients and identify pN0 patients who can benefit from POAT.

We retrospectively enrolled 974 ESCC patients who underwent radical esophagectomy with adequate lymph node dissection (≥ 15 lymph nodes), dividing them into training (432 patients), internal validation (185 patients), and external validation cohorts (357 patients). To predict LNM, we developed a Stacking model using multiple instance and ensemble learning, leveraging the visible lymph nodes, primary tumor features, and clinical characteristics of each patient. Additionally, we separately enrolled 386 pN0 patients who underwent insufficient lymph node dissection and classified them into low-risk and high-risk groups on the basis of the optimal cut-off value from the Stacking model. Kaplan‒Meier and Cox models were used to assess the impact of POAT on patient survival.

The Stacking model achieved area under the curve (AUC) values of 0.883, 0.834, and 0.819 in the training, internal validation, and external validation cohorts, respectively, outperforming traditional models based separately on tumor features, clinical characteristics, or the features of the largest lymph node. pN0 patients identified by the Stacking model as high risk had worse overall survival than low-risk patients did. POAT provided survival benefits for the high-risk patients but had no significant effect on the survival of low-risk patients.

Our Stacking model achieved excellent LNM prediction in ESCC patients and holds promise for guiding personalized treatment strategies, particularly for pN0 patients with insufficient lymph node dissection.

The online version contains supplementary material available at 10.1186/s40644-026-01005-z.

## Linked entities

- **Diseases:** esophageal squamous cell carcinoma (MONDO:0005580)

## Full-text entities

- **Diseases:** lymph node metastasis (MESH:D008207), esophageal squamous cell carcinoma (MESH:D000077277)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13011690/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13011690/full.md

## References

2 references — full list in the complete paper: https://tomesphere.com/paper/PMC13011690/full.md

---
Source: https://tomesphere.com/paper/PMC13011690