# Automated Lymph Node Localization and Segmentation in Patients with Head and Neck Cancer: Opportunities and Limitations of Using a Generic AI Model

**Authors:** Miriam Rinneburger, Heike Carolus, Andra-Iza Iuga, Mathilda Weisthoff, Simon Lennartz, Nils Große Hokamp, Liliana Lourenco Caldeira, Astha Jaiswal, David Maintz, Fabian Christopher Laqua, Bettina Baeßler, Tobias Klinder, Thorsten Persigehl

PMC · DOI: 10.3390/diagnostics16020355 · 2026-01-21

## TL;DR

This study evaluates a generic AI model for identifying and segmenting lymph nodes in head and neck cancer patients, finding it effective for smaller nodes but less so for larger or necrotic ones.

## Contribution

The study assesses the performance of a pre-trained AI model on head and neck cancer lymph nodes, highlighting its limitations in detecting enlarged or necrotic nodes.

## Key findings

- The AI model achieved an average recall of 0.70 and a Dice score of 0.73 for lymph node segmentation.
- Segmentation accuracy was similar for metastatic and non-metastatic nodes, but localization was worse for metastatic nodes.
- Model performance dropped significantly for enlarged nodes (≥15 mm short-axis diameter), with a recall of 0.36.

## Abstract

Background/Objectives: Accurate assessment of lymph nodes is of paramount importance for correct cN staging in head and neck cancer; however, it is very time-consuming for radiologists, and lymph node metastases of head and neck cancers may show distinct characteristics, such as central necrosis or very large size. Here, we evaluate the performance of a previously developed generic cervical lymph node segmentation model in a cohort of patients with head and neck cancer. Methods: In our retrospective single-center, multi-vendor study, we included 125 patients with head and neck cancer with at least one untreated lymph node metastasis. On the respective cervical CT scan, an experienced radiologist segmented lymph nodes semi-automatically. All 3D segmentations were confirmed by a second reader. These manual segmentations were compared to segmentations generated by an AI model previously trained on a different dataset of varying cancers. Results: In cervical CT scans from 125 patients (61.9 years ± 10.6, 100 men), 3656 lymph nodes were segmented as ground-truth, including 544 clinical metastases. The AI achieved an average recall of 0.70 with 6.5 false positives per CT scan. The average global Dice accounts for 0.73 per scan, with an average Hausdorff distance of 0.88 mm. When analyzing the individual nodes, segmentation accuracy was similar for non-metastatic and metastatic lymph nodes, with a sensitivity of 0.89 and 0.85. Localization performance was lower for metastatic than for non-metastatic lymph nodes, with a recall of 0.65 and 0.74, respectively. Model performance was worse for enlarged nodes (short-axis diameter ≥ 15 mm), with a recall of 0.36 and a sensitivity of 0.67. Conclusions: The AI model for generic cervical lymph node segmentation shows good performance for smaller nodes (SAD ≤ 15 mm) with respect to localization and segmentation accuracy. However, for clearly enlarged and necrotic nodes, a retraining of the generic AI algorithm seems to be required for accurate cN staging.

## Linked entities

- **Diseases:** head and neck cancer (MONDO:0005627)

## Full-text entities

- **Diseases:** cancers (MESH:D009369), Head and Neck Cancer (MESH:D006258), lymph node metastases (MESH:D008207), necrosis (MESH:D009336), metastases (MESH:D009362)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12840324/full.md

---
Source: https://tomesphere.com/paper/PMC12840324