GLIMS: Attention-Guided Lightweight Multi-Scale Hybrid Network for   Volumetric Semantic Segmentation

Ziya Ata Yaz{\i}c{\i}; \.Ilkay \"Oks\"uz; Haz{\i}m Kemal Ekenel

arXiv:2404.17854·cs.CV·May 3, 2024

GLIMS: Attention-Guided Lightweight Multi-Scale Hybrid Network for Volumetric Semantic Segmentation

Ziya Ata Yaz{\i}c{\i}, \.Ilkay \"Oks\"uz, Haz{\i}m Kemal Ekenel

PDF

1 Repo

TL;DR

GLIMS is a novel hybrid neural network that combines attention mechanisms, dilated convolutions, and transformer components to achieve efficient and accurate volumetric medical image segmentation with fewer parameters.

Contribution

It introduces a data-efficient hybrid segmentation network with novel modules like DACB and CSAB, improving performance while reducing model complexity.

Findings

01

GLIMS outperforms Swin UNETR on BraTS2021 and BTCV datasets.

02

GLIMS achieves high accuracy with fewer trainable parameters.

03

The model demonstrates effective local-global feature integration.

Abstract

Convolutional Neural Networks (CNNs) have become widely adopted for medical image segmentation tasks, demonstrating promising performance. However, the inherent inductive biases in convolutional architectures limit their ability to model long-range dependencies and spatial correlations. While recent transformer-based architectures address these limitations by leveraging self-attention mechanisms to encode long-range dependencies and learn expressive representations, they often struggle to extract low-level features and are highly dependent on data availability. This motivated us for the development of GLIMS, a data-efficient attention-guided hybrid volumetric segmentation network. GLIMS utilizes Dilated Feature Aggregator Convolutional Blocks (DACB) to capture local-global feature correlations efficiently. Furthermore, the incorporated Swin Transformer-based bottleneck bridges the local…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yaziciz/GLIMS
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsAttention Is All You Need · *Communicated@Fast*How Do I Communicate to Expedia? · Linear Layer · 1x1 Convolution · Dense Connections · Max Pooling · Convolution · Concatenated Skip Connection · Batch Normalization · Multi-Head Attention