TL;DR
This paper introduces BiTr-Unet, a novel CNN-Transformer hybrid model that significantly improves brain tumor segmentation accuracy on multi-modal MRI scans, leveraging long-range feature extraction capabilities of transformers.
Contribution
The paper presents a new CNN-Transformer combined network specifically designed for brain tumor segmentation, achieving state-of-the-art performance on BraTS2021 datasets.
Findings
Achieved median Dice scores above 0.93 for whole tumor on validation data.
Outperformed existing methods in Hausdorff distance metrics.
Code is publicly available for reproducibility.
Abstract
Convolutional neural networks (CNNs) have achieved remarkable success in automatically segmenting organs or lesions on 3D medical images. Recently, vision transformer networks have exhibited exceptional performance in 2D image classification tasks. Compared with CNNs, transformer networks have an appealing advantage of extracting long-range features due to their self-attention algorithm. Therefore, we propose a CNN-Transformer combined model, called BiTr-Unet, with specific modifications for brain tumor segmentation on multi-modal MRI scans. Our BiTr-Unet achieves good performance on the BraTS2021 validation dataset with median Dice score 0.9335, 0.9304 and 0.8899, and median Hausdorff distance 2.8284, 2.2361 and 1.4142 for the whole tumor, tumor core, and enhancing tumor, respectively. On the BraTS2021 testing dataset, the corresponding results are 0.9257, 0.9350 and 0.8874 for Dice…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsAttention Is All You Need · Linear Layer · Softmax · Layer Normalization · Multi-Head Attention · Dense Connections · Residual Connection · Vision Transformer
