Rethinking Boundary Detection in Deep Learning-Based Medical Image Segmentation
Yi Lin, Dong Zhang, Xiao Fang, Yufan Chen, Kwang-Ting Cheng, Hao Chen

TL;DR
This paper introduces CTO, a novel deep learning architecture combining CNNs, ViT, and edge detection to improve boundary segmentation accuracy in medical images, outperforming existing methods across multiple datasets.
Contribution
The study presents CTO, a new network architecture that integrates local and long-range features with explicit boundary guidance, enhancing boundary segmentation in medical images without extra data or labels.
Findings
Achieves state-of-the-art accuracy on seven medical image datasets.
Balances segmentation accuracy and computational efficiency effectively.
Uses explicit boundary guidance to improve boundary area learning.
Abstract
Medical image segmentation is a pivotal task within the realms of medical image analysis and computer vision. While current methods have shown promise in accurately segmenting major regions of interest, the precise segmentation of boundary areas remains challenging. In this study, we propose a novel network architecture named CTO, which combines Convolutional Neural Networks (CNNs), Vision Transformer (ViT) models, and explicit edge detection operators to tackle this challenge. CTO surpasses existing methods in terms of segmentation accuracy and strikes a better balance between accuracy and efficiency, without the need for additional data inputs or label injections. Specifically, CTO adheres to the canonical encoder-decoder network paradigm, with a dual-stream encoder network comprising a mainstream CNN stream for capturing local features and an auxiliary StitchViT stream for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Medical Image Segmentation Techniques · Medical Imaging and Analysis
MethodsLinear Layer · Multi-Head Attention · Dense Connections · Adam · Attention Is All You Need · Dropout · Vision Transformer · Layer Normalization · Position-Wise Feed-Forward Layer · Byte Pair Encoding
