Gated Differential Linear Attention: A Linear-Time Decoder for High-Fidelity Medical Segmentation

Hongbo Zheng; Afshin Bozorgpour; Dorit Merhof; Minjia Zhang

arXiv:2603.02727·cs.CV·May 4, 2026

Gated Differential Linear Attention: A Linear-Time Decoder for High-Fidelity Medical Segmentation

Hongbo Zheng, Afshin Bozorgpour, Dorit Merhof, Minjia Zhang

PDF

1 Repo

TL;DR

This paper introduces a novel gated differential linear attention mechanism for medical image segmentation, achieving state-of-the-art accuracy with efficient linear-time complexity across various imaging modalities.

Contribution

It proposes a new attention module combining differential subtraction and gating, integrated into a transformer-based model for improved boundary preservation and efficiency.

Findings

01

Achieves state-of-the-art results on multiple medical segmentation benchmarks.

02

Maintains omplexity, enabling practical deployment.

03

Outperforms related baselines in accuracy and efficiency.

Abstract

Medical image segmentation requires models that preserve fine anatomical boundaries while remaining practical for clinical deployment. Transformers capture long-range dependencies but incur quadratic attention cost, whereas CNNs are efficient but less effective at global reasoning. Linear attention offers \(\mathcal{O}(N)\) scaling, but often produces diffuse feature aggregation that weakens boundary-sensitive prediction. We introduce a gated differential linear-attention mixer for medical image segmentation. Its global path, Gated Differential Linear Attention (GDLA), performs differential subtraction between two kernelized attention branches over complementary query/key subspaces to suppress redundant responses, and employs a data-dependent gate for token refinement. A parallel local token-mixing branch with depthwise convolution strengthens neighborhood interactions for better…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xmindflow/gdla
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.