MISSFormer: An Effective Medical Image Segmentation Transformer

Xiaohong Huang; Zhifang Deng; Dandan Li; Xueguang Yuan

arXiv:2109.07162·cs.CV·December 21, 2021·170 cites

MISSFormer: An Effective Medical Image Segmentation Transformer

Xiaohong Huang, Zhifang Deng, Dandan Li, Xueguang Yuan

PDF

Open Access 1 Repo

TL;DR

MISSFormer is a hierarchical transformer-based model for medical image segmentation that effectively captures both long-range dependencies and local context, outperforming existing methods even when trained from scratch.

Contribution

The paper introduces MISSFormer, a novel hierarchical transformer architecture with enhanced blocks and context bridges for improved medical image segmentation.

Findings

01

Outperforms state-of-the-art methods on multi-organ and cardiac segmentation.

02

Effective even when trained from scratch without pre-training.

03

Demonstrates robustness and generalizability to other segmentation tasks.

Abstract

The CNN-based methods have achieved impressive results in medical image segmentation, but they failed to capture the long-range dependencies due to the inherent locality of the convolution operation. Transformer-based methods are recently popular in vision tasks because of their capacity for long-range dependencies and promising performance. However, it lacks in modeling local context. In this paper, taking medical image segmentation as an example, we present MISSFormer, an effective and powerful Medical Image Segmentation tranSFormer. MISSFormer is a hierarchical encoder-decoder network with two appealing designs: 1) A feed-forward network is redesigned with the proposed Enhanced Transformer Block, which enhances the long-range dependencies and supplements the local context, making the feature more discriminative. 2) We proposed Enhanced Transformer Context Bridge, different from…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zhifangdeng/missformer
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · COVID-19 diagnosis using AI · Radiomics and Machine Learning in Medical Imaging

MethodsAttention Is All You Need · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Layer Normalization · Label Smoothing · Adam · Residual Connection · Multi-Head Attention · Softmax