# Liver Tumor Segmentation with Deep Learning: A Comparative Analysis of CNN-, Transformer-, and YOLO-Based Models on the ATLAS MRI

**Authors:** Büşra Karabağ, Kubilay Ayturan, Fırat Hardalaç

PMC · DOI: 10.3390/diagnostics16050649 · Diagnostics · 2026-02-24

## TL;DR

This paper compares different deep learning models for liver tumor segmentation in MRI scans, finding that 3D CNNs are most accurate, while transformers and YOLO models offer other benefits.

## Contribution

Systematic comparison of CNN, transformer, and YOLO models for liver and tumor segmentation in MRI using the ATLAS dataset.

## Key findings

- 3D nnU-Net achieved highest liver (Dice: 0.946) and tumor (Dice: 0.892) segmentation accuracy.
- Transformers improved boundary delineation and captured global context effectively.
- YOLO-based models offered balanced accuracy with lower computational cost.

## Abstract

Background/Objectives: Hepatocellular carcinoma (HCC) remains a leading cause of cancer-related mortality worldwide, where accurate liver and tumor segmentation from magnetic resonance imaging (MRI) is essential for diagnosis, treatment planning, and disease monitoring. Despite recent advances, MRI-based segmentation remains challenging due to data heterogeneity and limited annotated datasets. This study aims to systematically compare convolutional, transformer-based, and detection-based deep learning approaches for liver and HCC segmentation using contrast-enhanced MRI. Methods: A comprehensive evaluation was conducted on the ATLAS MRI dataset, including 2D- and 3D-CNN, transformer-based architectures, and single-stage YOLO-based segmentation frameworks. All models were trained using consistent preprocessing, patient-level data splits, and standardized evaluation metrics, including Dice coefficient, Intersection over Union (IoU), precision, recall, and F1-score. Results: Volumetric convolutional models achieved the highest segmentation accuracy, with the 3D nnU-Net yielding superior performance for both liver (Dice: 0.946) and tumor (Dice: 0.892) segmentation. Transformer-based models demonstrated competitive results, particularly in capturing global contextual information and improving boundary delineation, while YOLO-based approaches provided balanced accuracy with substantially reduced computational cost. Conclusions: The findings confirm that volumetric CNNs remain the most accurate solution for MRI-based liver and HCC segmentation, whereas transformer- and YOLO-based frameworks offer complementary advantages for specific clinical and resource-constrained scenarios.

## Linked entities

- **Diseases:** Hepatocellular carcinoma (MONDO:0007256), HCC (MONDO:0007256)

## Full-text entities

- **Diseases:** HCC (MESH:D006528), Liver Tumor (MESH:D008113), cancer (MESH:D009369)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12984958/full.md

## Figures

12 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12984958/full.md

## References

55 references — full list in the complete paper: https://tomesphere.com/paper/PMC12984958/full.md

---
Source: https://tomesphere.com/paper/PMC12984958