LeViT-UNet: Make Faster Encoders with Transformer for Medical Image   Segmentation

Guoping Xu; Xingrong Wu; Xuan Zhang; Xinwei He

arXiv:2107.08623·cs.CV·July 20, 2021·55 cites

LeViT-UNet: Make Faster Encoders with Transformer for Medical Image Segmentation

Guoping Xu, Xingrong Wu, Xuan Zhang, Xinwei He

PDF

Open Access 2 Repos

TL;DR

LeViT-UNet introduces a fast, efficient transformer-based encoder integrated into U-Net for improved medical image segmentation, balancing accuracy and computational complexity.

Contribution

The paper presents LeViT-UNet, a novel architecture that incorporates LeViT transformers into U-Net, enhancing efficiency and accuracy in medical image segmentation tasks.

Findings

01

Outperforms competing methods on Synapse and ACDC benchmarks.

02

Achieves better accuracy with reduced computational cost.

03

Effectively reuses spatial information through multi-scale skip connections.

Abstract

Medical image segmentation plays an essential role in developing computer-assisted diagnosis and therapy systems, yet still faces many challenges. In the past few years, the popular encoder-decoder architectures based on CNNs (e.g., U-Net) have been successfully applied in the task of medical image segmentation. However, due to the locality of convolution operations, they demonstrate limitations in learning global context and long-range spatial relations. Recently, several researchers try to introduce transformers to both the encoder and decoder components with promising results, but the efficiency requires further improvement due to the high computational complexity of transformers. In this paper, we propose LeViT-UNet, which integrates a LeViT Transformer module into the U-Net architecture, for fast and accurate medical image segmentation. Specifically, we use LeViT as the encoder of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Medical Image Segmentation Techniques · Radiomics and Machine Learning in Medical Imaging

MethodsMulti-Head Attention · Attention Is All You Need · *Communicated@Fast*How Do I Communicate to Expedia? · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Label Smoothing · Residual Connection · Concatenated Skip Connection · Dense Connections