TL;DR
GLIMS is a novel hybrid neural network that combines attention mechanisms, dilated convolutions, and transformer components to achieve efficient and accurate volumetric medical image segmentation with fewer parameters.
Contribution
It introduces a data-efficient hybrid segmentation network with novel modules like DACB and CSAB, improving performance while reducing model complexity.
Findings
GLIMS outperforms Swin UNETR on BraTS2021 and BTCV datasets.
GLIMS achieves high accuracy with fewer trainable parameters.
The model demonstrates effective local-global feature integration.
Abstract
Convolutional Neural Networks (CNNs) have become widely adopted for medical image segmentation tasks, demonstrating promising performance. However, the inherent inductive biases in convolutional architectures limit their ability to model long-range dependencies and spatial correlations. While recent transformer-based architectures address these limitations by leveraging self-attention mechanisms to encode long-range dependencies and learn expressive representations, they often struggle to extract low-level features and are highly dependent on data availability. This motivated us for the development of GLIMS, a data-efficient attention-guided hybrid volumetric segmentation network. GLIMS utilizes Dilated Feature Aggregator Convolutional Blocks (DACB) to capture local-global feature correlations efficiently. Furthermore, the incorporated Swin Transformer-based bottleneck bridges the local…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsAttention Is All You Need · *Communicated@Fast*How Do I Communicate to Expedia? · Linear Layer · 1x1 Convolution · Dense Connections · Max Pooling · Convolution · Concatenated Skip Connection · Batch Normalization · Multi-Head Attention
