Harnessing Lightweight Transformer with Contextual Synergic Enhancement for Efficient 3D Medical Image Segmentation
Xinyu Liu, Zhen Chen, Wuyang Li, Chenxin Li, Yixuan Yuan

TL;DR
This paper introduces Light-UNETR, a lightweight transformer model with novel modules and a learning strategy that significantly improves 3D medical image segmentation efficiency and performance, especially with limited labeled data.
Contribution
We propose Light-UNETR, featuring LIDR and CGLU modules, along with CSE learning to enhance model and data efficiency in 3D medical image segmentation.
Findings
Outperforms existing methods with only 10% labeled data.
Reduces FLOPs by 90.8% and parameters by 85.8%.
Achieves 1.43% higher Jaccard score on Left Atrial Segmentation.
Abstract
Transformers have shown remarkable performance in 3D medical image segmentation, but their high computational requirements and need for large amounts of labeled data limit their applicability. To address these challenges, we consider two crucial aspects: model efficiency and data efficiency. Specifically, we propose Light-UNETR, a lightweight transformer designed to achieve model efficiency. Light-UNETR features a Lightweight Dimension Reductive Attention (LIDR) module, which reduces spatial and channel dimensions while capturing both global and local features via multi-branch attention. Additionally, we introduce a Compact Gated Linear Unit (CGLU) to selectively control channel interaction with minimal parameters. Furthermore, we introduce a Contextual Synergic Enhancement (CSE) learning strategy, which aims to boost the data efficiency of Transformers. It first leverages the extrinsic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Multimodal Machine Learning Applications · Generative Adversarial Networks and Image Synthesis
