Comparative Study of UNet-based Architectures for Liver Tumor Segmentation in Multi-Phase Contrast-Enhanced Computed Tomography

Doan-Van-Anh Ly (1); Thanh-Hai Le (1); Thi-Thu-Hien Pham (2; 3) ((1) The Saigon International University; (2) International University; (3) Vietnam National University HCMC)

arXiv:2510.25522·cs.CV·January 21, 2026

Comparative Study of UNet-based Architectures for Liver Tumor Segmentation in Multi-Phase Contrast-Enhanced Computed Tomography

Doan-Van-Anh Ly (1), Thanh-Hai Le (1), Thi-Thu-Hien Pham (2, 3) ((1) The Saigon International University, (2) International University, (3) Vietnam National University HCMC)

PDF

TL;DR

This study compares various UNet-based architectures for liver tumor segmentation in multi-phase contrast-enhanced CT, finding ResNet-based models with attention modules to be most effective and stable, despite the theoretical advantages of newer models.

Contribution

It introduces attention mechanisms into ResNet-based UNet architectures and demonstrates their effectiveness and stability in liver tumor segmentation.

Findings

01

ResNet-based models outperform Transformer and State-space models in sample efficiency.

02

Adding CBAM attention improves segmentation performance and boundary delineation.

03

ResNetUNet3+ with CBAM achieves the highest Dice score and lowest boundary error.

Abstract

Segmentation of liver structures in multi-phase contrast-enhanced computed tomography (CECT) plays a crucial role in computer-aided diagnosis and treatment planning. In this study, we investigate the performance of UNet-based architectures for liver tumor segmentation, evaluating ResNet, Transformer-based, and State-space (Mamba) backbones initialized with pretrained weights. Our comparative analysis reveals that despite the theoretical advantages of modern architectures in modeling long-range dependencies, ResNet-based models demonstrated superior sample efficiency on this dataset. This suggests that the inherent inductive biases of Convolutional Neural Networks (CNNs) remain advantageous for generalizing on limited medical data compared to data-hungry alternatives. To further improve segmentation quality, we introduce attention mechanisms into the backbone, finding that the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.