TransBTS: Multimodal Brain Tumor Segmentation Using Transformer
Wenxuan Wang, Chen Chen, Meng Ding, Jiangyun Li, Hong Yu, Sen Zha

TL;DR
TransBTS introduces a novel Transformer-based network for 3D MRI brain tumor segmentation, combining local 3D CNN features with global Transformer modeling to improve segmentation accuracy.
Contribution
This work is the first to integrate Transformer with 3D CNNs for brain tumor segmentation in MRI, enhancing global context understanding.
Findings
Achieves state-of-the-art results on BraTS 2019 and 2020 datasets.
Outperforms previous 3D segmentation methods.
Demonstrates the effectiveness of Transformer in medical image segmentation.
Abstract
Transformer, which can benefit from global (long-range) information modeling using self-attention mechanisms, has been successful in natural language processing and 2D image classification recently. However, both local and global features are crucial for dense prediction tasks, especially for 3D medical image segmentation. In this paper, we for the first time exploit Transformer in 3D CNN for MRI Brain Tumor Segmentation and propose a novel network named TransBTS based on the encoder-decoder structure. To capture the local 3D context information, the encoder first utilizes 3D CNN to extract the volumetric spatial feature maps. Meanwhile, the feature maps are reformed elaborately for tokens that are fed into Transformer for global feature modeling. The decoder leverages the features embedded by Transformer and performs progressive upsampling to predict the detailed segmentation map.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Brain Tumor Detection and Classification · Medical Image Segmentation Techniques
Methods3 Dimensional Convolutional Neural Network · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Attention Is All You Need · Layer Normalization · Dropout · Adam · Residual Connection · Byte Pair Encoding
