TK-Mamba: Marrying KAN With Mamba for Text-Driven 3D Medical Image Segmentation
Haoyu Yang, Yutong Guan, Meixing Shi, Yuxiang Cai, Jintao Chen, Sun Bing, Wenhui Lei, Mianxin Liu, Xiaoming Shi, Yankai Jiang, Jianwei Yin

TL;DR
TK-Mamba introduces a novel hybrid framework combining KAN and Mamba for efficient, accurate 3D medical image segmentation, leveraging multimodal data and advanced nonlinear transformations.
Contribution
It is the first to apply KAN in 3D medical imaging and integrates a dual-branch text-driven strategy for improved segmentation robustness.
Findings
Achieves state-of-the-art accuracy in organ and tumor segmentation.
Demonstrates superior computational efficiency over existing methods.
Effectively models complex 3D structures with novel 3D-GR-KAN.
Abstract
3D medical image segmentation is important for clinical diagnosis and treatment but faces challenges from high-dimensional data and complex spatial dependencies. Traditional single-modality networks, such as CNNs and Transformers, are often limited by computational inefficiency and constrained contextual modeling in 3D settings. To alleviate these limitations, we propose TK-Mamba, a multimodal framework that fuses the linear-time Mamba with Kolmogorov-Arnold Networks (KAN) to form an efficient hybrid backbone. Our approach is characterized by two primary technical contributions. Firstly, we introduce the novel 3D-Group-Rational KAN (3D-GR-KAN), which marks the first application of KAN in 3D medical imaging, providing a superior and computationally efficient nonlinear feature transformation crucial for complex volumetric structures. Secondly, we devise a dual-branch text-driven strategy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Computer Science and Engineering
Methods+ ( 1 ) ⟷ 805 ⟷ ( 330 ) ⟷ 4056|How do I file a complaint with Expedia? · Mamba: Linear-Time Sequence Modeling with Selective State Spaces
