TransLK-Net: Entangling Transformer and Large Kernel for Progressive and Collaborative Feature Encoding and Decoding in Medical Image Segmentation

Jin Yang; Daniel S.Marcus; and Aristeidis Sotiras

arXiv:2511.17873·eess.IV·November 25, 2025

TransLK-Net: Entangling Transformer and Large Kernel for Progressive and Collaborative Feature Encoding and Decoding in Medical Image Segmentation

Jin Yang, Daniel S.Marcus, and Aristeidis Sotiras

PDF

Open Access

TL;DR

TransLK-Net introduces a novel encoder-decoder architecture combining transformer and large kernel convolutions with attention mechanisms for improved medical image segmentation, addressing limitations of CNNs and ViTs.

Contribution

It proposes PTLK and CTLK modules that integrate multi-scale local features and global information efficiently, along with an Attention Entanglement mechanism for progressive feature enhancement.

Findings

01

Enhanced segmentation accuracy on medical images

02

Effective multi-scale feature capture and global context modeling

03

Reduced computational complexity compared to traditional self-attention

Abstract

Convolutional neural networks (CNNs) and vision transformers (ViTs) are widely employed for medical image segmentation, but they are still challenged by their intrinsic characteristics. CNNs are limited from capturing varying-scaled features and global contextual information due to the employment of fixed-sized kernels. In contrast, ViTs employ self-attention and MLP for global information modeling, but they lack mechanisms to learn spatial-wise local information. Additionally, self-attention leads the network to show high computational complexity. To tackle these limitations, we propose Progressively Entangled Transformer Large Kernel (PTLK) and Collaboratively Entangled Transformer Large Kernel (CTLK) modules to leverage the benefits of self-attention and large kernel convolutions and overcome shortcomings. Specifically, PTLK and CTLK modules employ the Multi-head Large Kernel to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis · Face recognition and analysis