iSegFormer: Interactive Segmentation via Transformers with Application to 3D Knee MR Images
Qin Liu, Zhenlin Xu, Yining Jiao, Marc Niethammer

TL;DR
iSegFormer introduces a memory-efficient transformer model combining Swin transformer and MLP decoder, achieving high performance and efficiency in interactive 3D medical image segmentation.
Contribution
The paper presents a novel transformer architecture, iSegFormer, optimized for memory efficiency and applied to 3D medical image segmentation tasks.
Findings
High computational efficiency in 3D segmentation
Effective combination of hierarchical self-attention and MLP decoding
Improved segmentation accuracy on medical images
Abstract
We propose iSegFormer, a memory-efficient transformer that combines a Swin transformer with a lightweight multilayer perceptron (MLP) decoder. With the efficient Swin transformer blocks for hierarchical self-attention and the simple MLP decoder for aggregating both local and global attention, iSegFormer learns powerful representations while achieving high computational efficiencies. Specifically, we apply iSegFormer to interactive 3D medical image segmentation.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Medical Image Segmentation Techniques · Brain Tumor Detection and Classification
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Softmax · Dense Connections · Stochastic Depth · Residual Connection · Layer Normalization · Swin Transformer
