SegStitch: Multidimensional Transformer for Robust and Efficient Medical Imaging Segmentation
Shengbo Tan, Zeyu Zhang, Ying Cai, Daji Ergu, Lin Wu, Binbin Hu,, Pengzhang Yu, Yang Zhao

TL;DR
SegStitch introduces a transformer-based architecture with denoising ODE blocks for efficient and robust 3D medical image segmentation, improving accuracy and reducing computational costs compared to existing methods.
Contribution
The paper presents SegStitch, a novel transformer architecture with denoising ODE blocks that enhances 3D medical image segmentation efficiency and accuracy.
Findings
Achieved up to 11.48% improvement in mDSC on BTCV dataset.
Reduced parameters by 36.7% and FLOPS by 10.7% compared to UNETR.
Demonstrated robustness and efficiency suitable for clinical application.
Abstract
Medical imaging segmentation plays a significant role in the automatic recognition and analysis of lesions. State-of-the-art methods, particularly those utilizing transformers, have been prominently adopted in 3D semantic segmentation due to their superior performance in scalability and generalizability. However, plain vision transformers encounter challenges due to their neglect of local features and their high computational complexity. To address these challenges, we introduce three key contributions: Firstly, we proposed SegStitch, an innovative architecture that integrates transformers with denoising ODE blocks. Instead of taking whole 3D volumes as inputs, we adapt axial patches and customize patch-wise queries to ensure semantic consistency. Additionally, we conducted extensive experiments on the BTCV and ACDC datasets, achieving improvements up to 11.48% and 6.71% respectively in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBrain Tumor Detection and Classification
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Max Pooling · Softmax · Dense Connections · Batch Normalization · Linear Layer · Concatenated Skip Connection · Residual Connection · U-Net · Multi-Head Attention
