LeViT-UNet: Make Faster Encoders with Transformer for Medical Image Segmentation
Guoping Xu, Xingrong Wu, Xuan Zhang, Xinwei He

TL;DR
LeViT-UNet introduces a fast, efficient transformer-based encoder integrated into U-Net for improved medical image segmentation, balancing accuracy and computational complexity.
Contribution
The paper presents LeViT-UNet, a novel architecture that incorporates LeViT transformers into U-Net, enhancing efficiency and accuracy in medical image segmentation tasks.
Findings
Outperforms competing methods on Synapse and ACDC benchmarks.
Achieves better accuracy with reduced computational cost.
Effectively reuses spatial information through multi-scale skip connections.
Abstract
Medical image segmentation plays an essential role in developing computer-assisted diagnosis and therapy systems, yet still faces many challenges. In the past few years, the popular encoder-decoder architectures based on CNNs (e.g., U-Net) have been successfully applied in the task of medical image segmentation. However, due to the locality of convolution operations, they demonstrate limitations in learning global context and long-range spatial relations. Recently, several researchers try to introduce transformers to both the encoder and decoder components with promising results, but the efficiency requires further improvement due to the high computational complexity of transformers. In this paper, we propose LeViT-UNet, which integrates a LeViT Transformer module into the U-Net architecture, for fast and accurate medical image segmentation. Specifically, we use LeViT as the encoder of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Medical Image Segmentation Techniques · Radiomics and Machine Learning in Medical Imaging
MethodsMulti-Head Attention · Attention Is All You Need · *Communicated@Fast*How Do I Communicate to Expedia? · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Label Smoothing · Residual Connection · Concatenated Skip Connection · Dense Connections
