# External Validation of an Open-Source Model for Automated Muscle Segmentation in CT Imaging of Cancer Patients

**Authors:** Hendrik Erenstein, Jona Van den Broeck, Annemieke van der Heij-Meijer, Wim P. Krijnen, Aldo Scafoglieri, Harriët Jager-Wittenaar, Martine Sealy, Peter van Ooijen

PMC · DOI: 10.3390/jimaging12030135 · Journal of Imaging · 2026-03-18

## TL;DR

This study validates an open-source AI model for automatically segmenting muscles in CT scans of cancer patients, showing high accuracy but noting limitations at low BMI.

## Contribution

The study provides external validation of an open-source AI model for muscle segmentation in CT scans using an independent dataset.

## Key findings

- The AI model achieved a median Dice Similarity Coefficient of 0.978 and a median Segmentation Surface Error of 3.863 cm².
- Segmentation errors were more frequent in abdominal wall muscles and at lower BMI values.
- The model showed robustness to variations in arm positioning during scans.

## Abstract

Computed tomography (CT) at the third lumbar vertebra (L3) is widely used for muscle quantification, but manual segmentation is labor intensive. This study externally validates an AI model, trained on a public dataset, for automated L3 muscle segmentation using an independent cohort, including a subgroup analysis of subject characteristics (e.g., age and a history of cancer). The AI model was trained on 900 CT scans with expert annotations from a publicly available repository. Validation was performed on 232 PET CT scans from the University Hospital Brussels, each manually segmented by an expert. Segmentation post-processing employed a density-based clustering algorithm to discard arm muscles and Hounsfield unit (HU) thresholding to refine the muscle segmentation. Performance was assessed using the Dice Similarity Coefficient (DSC) and Segmentation Surface Error (SSE). The model achieved a median DSC of 0.978 and a median SSE of 3.863 cm2 across the validation set. At lower BMI values, the model was more prone to overestimation of muscle surface area. Most segmentation errors occurred in the abdominal wall muscles. Analysis showed no significant difference between arm positioning above the head and alongside the body, indicating robustness to minor artifacts from arm positioning. The AI model delivers accurate, automated L3 muscle segmentation, supporting larger-scale body composition studies. However, diminished accuracy at low BMI values and limited demographic diversity of the data highlight the need for broader validation.

## Linked entities

- **Diseases:** cancer (MONDO:0004992)

## Full-text entities

- **Diseases:** lung (MESH:D008171), HNC (MESH:D006258), Muscle (MESH:D019042), metastases (MESH:D009362), CCI (MESH:C566784), Comorbidity (MESH:D004194), Sarcopenia (MESH:D055948), lung cancer (MESH:D008175), Malnutrition (MESH:D044342), falls (MESH:C537863), injury to (MESH:D014947), esophageal (MESH:D004941), underweight (MESH:D013851), DSC (MESH:C536318), Cancer (MESH:D009369), loss of muscle mass (MESH:C536030), melanoma (MESH:D008545)
- **Chemicals:** HU (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13028208/full.md

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/PMC13028208/full.md

## References

37 references — full list in the complete paper: https://tomesphere.com/paper/PMC13028208/full.md

---
Source: https://tomesphere.com/paper/PMC13028208