SegKAN: High-Resolution Medical Image Segmentation with Long-Distance Dependencies
Shengbo Tan, Rundong Xue, Shipeng Luo, Zeyu Zhang, Xinran Wang, Lei, Zhang, Daji Ergu, Zhang Yi, Yang Zhao, Ying Cai

TL;DR
SegKAN is a novel model for high-resolution medical image segmentation that enhances vessel integrity by improving embedding modules and capturing long-distance dependencies, leading to better segmentation accuracy.
Contribution
The paper introduces a new convolutional structure and a method to transform spatial relationships into temporal ones to improve vessel segmentation in noisy CT images.
Findings
Dice score improved by 1.78% over state-of-the-art
Effective noise smoothing and gradient stability in embeddings
Enhanced capture of long-distance dependencies in segmentation
Abstract
Hepatic vessels in computed tomography scans often suffer from image fragmentation and noise interference, making it difficult to maintain vessel integrity and posing significant challenges for vessel segmentation. To address this issue, we propose an innovative model: SegKAN. First, we improve the conventional embedding module by adopting a novel convolutional network structure for image embedding, which smooths out image noise and prevents issues such as gradient explosion in subsequent stages. Next, we transform the spatial relationships between Patch blocks into temporal relationships to solve the problem of capturing positional relationships between Patch blocks in traditional Vision Transformer models. We conducted experiments on a Hepatic vessel dataset, and compared to the existing state-of-the-art model, the Dice score improved by 1.78%. These results demonstrate that the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Image Segmentation Techniques · Brain Tumor Detection and Classification · Radiomics and Machine Learning in Medical Imaging
MethodsByte Pair Encoding · Linear Layer · Absolute Position Encodings · Dropout · Softmax · Attention Is All You Need · Dense Connections · Residual Connection · Vision Transformer · Multi-Head Attention
