Improving Prostate Gland Segmentation Using Transformer based Architectures

Shatha Abudalou; Jung Choi; Yasin Yilmaz; Yoganand Balagurunathan

arXiv:2506.14844·eess.IV·April 23, 2026

Improving Prostate Gland Segmentation Using Transformer based Architectures

Shatha Abudalou, Jung Choi, Yasin Yilmaz, Yoganand Balagurunathan

PDF

TL;DR

This study evaluates transformer-based models, SwinUNETR and UNETR, for prostate MRI segmentation, demonstrating their robustness and improved accuracy over traditional CNNs, especially in heterogeneous datasets.

Contribution

It introduces the application of SwinUNETR for prostate segmentation, showing superior performance and robustness compared to previous CNN-based methods.

Findings

01

SwinUNETR achieved higher Dice scores than UNETR and baseline CNNs.

02

Transformer models showed reduced label noise sensitivity.

03

Performance improved notably on larger gland size datasets.

Abstract

Inter reader variability and cross site domain shift challenge the automatic segmentation of prostate anatomy using T2 weighted MRI images. This study investigates whether transformer models can retain precision amid such heterogeneity. We compare the performance of UNETR and SwinUNETR in prostate gland segmentation against our previous 3D UNet model [1], based on 546 MRI (T2weighted) volumes annotated by two independent experts. Three training strategies were analyzed: single cohort dataset, 5 fold cross validated mixed cohort, and gland size based dataset. Hyperparameters were tuned by Optuna. The test set, from an independent population of readers, served as the evaluation endpoint (Dice Similarity Coefficient). In single reader training, SwinUNETR achieved an average dice score of 0.816 for Reader#1 and 0.860 for Reader#2, while UNETR scored 0.8 and 0.833 for Readers #1 and #2,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.