Comparative Study of UNet-based Architectures for Liver Tumor Segmentation in Multi-Phase Contrast-Enhanced Computed Tomography
Doan-Van-Anh Ly (1), Thanh-Hai Le (1), Thi-Thu-Hien Pham (2, 3) ((1) The Saigon International University, (2) International University, (3) Vietnam National University HCMC)

TL;DR
This study compares various UNet-based architectures for liver tumor segmentation in multi-phase contrast-enhanced CT, finding ResNet-based models with attention modules to be most effective and stable, despite the theoretical advantages of newer models.
Contribution
It introduces attention mechanisms into ResNet-based UNet architectures and demonstrates their effectiveness and stability in liver tumor segmentation.
Findings
ResNet-based models outperform Transformer and State-space models in sample efficiency.
Adding CBAM attention improves segmentation performance and boundary delineation.
ResNetUNet3+ with CBAM achieves the highest Dice score and lowest boundary error.
Abstract
Segmentation of liver structures in multi-phase contrast-enhanced computed tomography (CECT) plays a crucial role in computer-aided diagnosis and treatment planning. In this study, we investigate the performance of UNet-based architectures for liver tumor segmentation, evaluating ResNet, Transformer-based, and State-space (Mamba) backbones initialized with pretrained weights. Our comparative analysis reveals that despite the theoretical advantages of modern architectures in modeling long-range dependencies, ResNet-based models demonstrated superior sample efficiency on this dataset. This suggests that the inherent inductive biases of Convolutional Neural Networks (CNNs) remain advantageous for generalizing on limited medical data compared to data-hungry alternatives. To further improve segmentation quality, we introduce attention mechanisms into the backbone, finding that the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
