TransBTSV2: Towards Better and More Efficient Volumetric Segmentation of Medical Images
Jiangyun Li, Wenxuan Wang, Chen Chen, Tianxiang Zhang, Sen Zha, Jing, Wang, Hong Yu

TL;DR
TransBTSV2 introduces a hybrid CNN-Transformer network for efficient and accurate 3D medical image segmentation, outperforming existing methods across multiple datasets without requiring pre-training.
Contribution
The paper presents TransBTSV2, a novel hybrid architecture that combines CNNs and Transformers with a redesigned Transformer block and deformable bottleneck module for improved volumetric segmentation.
Findings
Achieves state-of-the-art results on four medical datasets.
Operates effectively without pre-training.
Provides a general framework for various medical image segmentation tasks.
Abstract
Transformer, benefiting from global (long-range) information modeling using self-attention mechanism, has been successful in natural language processing and computer vision recently. Convolutional Neural Networks, capable of capturing local features, are difficult to model explicit long-distance dependencies from global feature space. However, both local and global features are crucial for dense prediction tasks, especially for 3D medical image segmentation. In this paper, we present the further attempt to exploit Transformer in 3D CNN for 3D medical image volumetric segmentation and propose a novel network named TransBTSV2 based on the encoder-decoder structure. Different from TransBTS, the proposed TransBTSV2 is not limited to brain tumor segmentation (BTS) but focuses on general medical image segmentation, providing a stronger and more efficient 3D baseline for volumetric…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiomics and Machine Learning in Medical Imaging · Advanced Neural Network Applications · Brain Tumor Detection and Classification
MethodsMulti-Head Attention · Attention Is All You Need · 3 Dimensional Convolutional Neural Network · Linear Layer · Residual Connection · Dense Connections · Byte Pair Encoding · Absolute Position Encodings · Softmax · Dropout
