Enhancing Medical Image Segmentation with TransCeption: A Multi-Scale Feature Fusion Approach
Reza Azad, Yiwei Jia, Ehsan Khodapanah Aghdam, Julien Cohen-Adad,, Dorit Merhof

TL;DR
TransCeption introduces a pure transformer-based U-Net variant with multi-scale feature fusion and a dual transformer bridge, significantly improving medical image segmentation performance across multiple tasks.
Contribution
It proposes a novel multi-scale feature fusion approach with a dual transformer bridge within a transformer-based U-Net for enhanced medical image segmentation.
Findings
Outperforms previous methods on multi-organ segmentation
Achieves superior results on skin lesion segmentation
Demonstrates effective multi-scale feature extraction and fusion
Abstract
While CNN-based methods have been the cornerstone of medical image segmentation due to their promising performance and robustness, they suffer from limitations in capturing long-range dependencies. Transformer-based approaches are currently prevailing since they enlarge the reception field to model global contextual correlation. To further extract rich representations, some extensions of the U-Net employ multi-scale feature extraction and fusion modules and obtain improved performance. Inspired by this idea, we propose TransCeption for medical image segmentation, a pure transformer-based U-shape network featured by incorporating the inception-like module into the encoder and adopting a contextual bridge for better feature fusion. The design proposed in this work is based on three core principles: (1) The patch merging module in the encoder is redesigned with ResInception Patch Merging…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Radiomics and Machine Learning in Medical Imaging · COVID-19 diagnosis using AI
MethodsMulti-Head Attention · Attention Is All You Need · *Communicated@Fast*How Do I Communicate to Expedia? · Dense Connections · Adam · Position-Wise Feed-Forward Layer · Softmax · Linear Layer · Absolute Position Encodings · Dropout
