Systematic Integration of Attention Modules into CNNs for Accurate and Generalizable Medical Image Diagnosis
Zahid Ullah, Minki Hong, Tahir Mahmood, Jihie Kim

TL;DR
This paper systematically integrates attention modules into various CNN architectures to improve accuracy, interpretability, and generalization in medical image diagnosis across multiple datasets.
Contribution
It introduces a systematic framework for embedding attention mechanisms into diverse CNNs and evaluates their impact on medical image classification tasks.
Findings
Attention-augmented CNNs outperform baseline models in accuracy.
Hybrid attention modules yield the best performance, especially with EfficientNetB5.
Attention mechanisms improve feature localization and model generalization.
Abstract
Deep learning has become a powerful tool for medical image analysis; however, conventional Convolutional Neural Networks (CNNs) often fail to capture the fine-grained and complex features critical for accurate diagnosis. To address this limitation, we systematically integrate attention mechanisms into five widely adopted CNN architectures, namely, VGG16, ResNet18, InceptionV3, DenseNet121, and EfficientNetB5, to enhance their ability to focus on salient regions and improve discriminative performance. Specifically, each baseline model is augmented with either a Squeeze and Excitation block or a hybrid Convolutional Block Attention Module, allowing adaptive recalibration of channel and spatial feature representations. The proposed models are evaluated on two distinct medical imaging datasets, a brain tumor MRI dataset comprising multiple tumor subtypes, and a Products of Conception…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBrain Tumor Detection and Classification
