Medical Image Segmentation Using Directional Window Attention
Daniya Najiha Abdul Kareem, Mustansar Fiaz, Noa Novershtern, Hisham, Cholakkal

TL;DR
DwinFormer is a hierarchical transformer architecture that enhances medical image segmentation by effectively capturing local and global features through directional window attention, outperforming existing methods on multi-organ datasets.
Contribution
The paper introduces DwinFormer, a novel hierarchical encoder-decoder with directional window attention blocks that improve local and global feature extraction for medical image segmentation.
Findings
Outperforms state-of-the-art on Synapse dataset
Effective multi-directional attention captures complex structures
Improves segmentation accuracy in 3D medical images
Abstract
Accurate segmentation of medical images is crucial for diagnostic purposes, including cell segmentation, tumor identification, and organ localization. Traditional convolutional neural network (CNN)-based approaches struggled to achieve precise segmentation results due to their limited receptive fields, particularly in cases involving multi-organ segmentation with varying shapes and sizes. The transformer-based approaches address this limitation by leveraging the global receptive field, but they often face challenges in capturing local information required for pixel-precise segmentation. In this work, we introduce DwinFormer, a hierarchical encoder-decoder architecture for medical image segmentation comprising a directional window (Dwin) attention and global self-attention (GSA) for feature encoding. The focus of our design is the introduction of Dwin block within DwinFormer that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Image Segmentation Techniques · Brain Tumor Detection and Classification · Image Retrieval and Classification Techniques
MethodsSoftmax · Attention Is All You Need · Focus
