TL;DR
This paper introduces a novel gated differential linear attention mechanism for medical image segmentation, achieving state-of-the-art accuracy with efficient linear-time complexity across various imaging modalities.
Contribution
It proposes a new attention module combining differential subtraction and gating, integrated into a transformer-based model for improved boundary preservation and efficiency.
Findings
Achieves state-of-the-art results on multiple medical segmentation benchmarks.
Maintains omplexity, enabling practical deployment.
Outperforms related baselines in accuracy and efficiency.
Abstract
Medical image segmentation requires models that preserve fine anatomical boundaries while remaining practical for clinical deployment. Transformers capture long-range dependencies but incur quadratic attention cost, whereas CNNs are efficient but less effective at global reasoning. Linear attention offers \(\mathcal{O}(N)\) scaling, but often produces diffuse feature aggregation that weakens boundary-sensitive prediction. We introduce a gated differential linear-attention mixer for medical image segmentation. Its global path, Gated Differential Linear Attention (GDLA), performs differential subtraction between two kernelized attention branches over complementary query/key subspaces to suppress redundant responses, and employs a data-dependent gate for token refinement. A parallel local token-mixing branch with depthwise convolution strengthens neighborhood interactions for better…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
