Breast Ultrasound Tumor Classification Using a Hybrid Multitask CNN-Transformer Network
Bryar Shareef, Min Xian, Aleksandar Vakanski, Haotian Wang

TL;DR
This paper introduces a hybrid CNN-Transformer neural network for breast ultrasound tumor classification and segmentation, effectively capturing both local and global image features to improve accuracy.
Contribution
The study presents a novel hybrid multitask deep neural network combining CNNs and Swin Transformer components for improved BUS tumor classification and segmentation.
Findings
Achieved highest accuracy of 82.7% among compared methods
Attained sensitivity of 86.4%, outperforming others
F1 score of 86.0%, indicating balanced performance
Abstract
Capturing global contextual information plays a critical role in breast ultrasound (BUS) image classification. Although convolutional neural networks (CNNs) have demonstrated reliable performance in tumor classification, they have inherent limitations for modeling global and long-range dependencies due to the localized nature of convolution operations. Vision Transformers have an improved capability of capturing global contextual information but may distort the local image patterns due to the tokenization operations. In this study, we proposed a hybrid multitask deep neural network called Hybrid-MT-ESTAN, designed to perform BUS tumor classification and segmentation using a hybrid architecture composed of CNNs and Swin Transformer components. The proposed approach was compared to nine BUS classification methods and evaluated using seven quantitative metrics on a dataset of 3,320 BUS…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Radiomics and Machine Learning in Medical Imaging · Brain Tumor Detection and Classification
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Residual Connection · Adam · Dense Connections · Label Smoothing · Dropout · Convolution · Absolute Position Encodings
