TEC-Net: Vision Transformer Embrace Convolutional Neural Networks for Medical Image Segmentation
Rui Sun, Tao Lei, Weichuan Zhang, Yong Wan, Yong Xia, Asoke K. Nandi

TL;DR
TEC-Net combines deformable convolution and a novel attention module within a hybrid CNN-Transformer architecture to improve medical image segmentation accuracy, efficiency, and adaptability without pre-training.
Contribution
The paper introduces TEC-Net, a novel hybrid architecture that enhances feature extraction and long-range dependency modeling in medical images, outperforming existing methods.
Findings
TEC-Net achieves superior segmentation accuracy compared to state-of-the-art methods.
The model requires fewer parameters and less computation.
TEC-Net does not rely on pre-training, simplifying deployment.
Abstract
The hybrid architecture of convolution neural networks (CNN) and Transformer has been the most popular method for medical image segmentation. However, the existing networks based on the hybrid architecture suffer from two problems. First, although the CNN branch can capture image local features by using convolution operation, the vanilla convolution is unable to achieve adaptive extraction of image features. Second, although the Transformer branch can model the global information of images, the conventional self-attention only focuses on the spatial self-attention of images and ignores the channel and cross-dimensional self-attention leading to low segmentation accuracy for medical images with complex backgrounds. To solve these problems, we propose vision Transformer embrace convolutional neural networks for medical image segmentation (TEC-Net). Our network has two advantages. First,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Radiomics and Machine Learning in Medical Imaging · Medical Imaging and Analysis
MethodsMulti-Head Attention · Attention Is All You Need · Dropout · Residual Connection · Linear Layer · Label Smoothing · Adam · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Layer Normalization
