MISSFormer: An Effective Medical Image Segmentation Transformer
Xiaohong Huang, Zhifang Deng, Dandan Li, Xueguang Yuan

TL;DR
MISSFormer is a hierarchical transformer-based model for medical image segmentation that effectively captures both long-range dependencies and local context, outperforming existing methods even when trained from scratch.
Contribution
The paper introduces MISSFormer, a novel hierarchical transformer architecture with enhanced blocks and context bridges for improved medical image segmentation.
Findings
Outperforms state-of-the-art methods on multi-organ and cardiac segmentation.
Effective even when trained from scratch without pre-training.
Demonstrates robustness and generalizability to other segmentation tasks.
Abstract
The CNN-based methods have achieved impressive results in medical image segmentation, but they failed to capture the long-range dependencies due to the inherent locality of the convolution operation. Transformer-based methods are recently popular in vision tasks because of their capacity for long-range dependencies and promising performance. However, it lacks in modeling local context. In this paper, taking medical image segmentation as an example, we present MISSFormer, an effective and powerful Medical Image Segmentation tranSFormer. MISSFormer is a hierarchical encoder-decoder network with two appealing designs: 1) A feed-forward network is redesigned with the proposed Enhanced Transformer Block, which enhances the long-range dependencies and supplements the local context, making the feature more discriminative. 2) We proposed Enhanced Transformer Context Bridge, different from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · COVID-19 diagnosis using AI · Radiomics and Machine Learning in Medical Imaging
MethodsAttention Is All You Need · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Layer Normalization · Label Smoothing · Adam · Residual Connection · Multi-Head Attention · Softmax
