Systematic Integration of Attention Modules into CNNs for Accurate and Generalizable Medical Image Diagnosis

Zahid Ullah; Minki Hong; Tahir Mahmood; Jihie Kim

arXiv:2509.05343·cs.CV·September 9, 2025·2 cites

Systematic Integration of Attention Modules into CNNs for Accurate and Generalizable Medical Image Diagnosis

Zahid Ullah, Minki Hong, Tahir Mahmood, Jihie Kim

PDF

Open Access

TL;DR

This paper systematically integrates attention modules into various CNN architectures to improve accuracy, interpretability, and generalization in medical image diagnosis across multiple datasets.

Contribution

It introduces a systematic framework for embedding attention mechanisms into diverse CNNs and evaluates their impact on medical image classification tasks.

Findings

01

Attention-augmented CNNs outperform baseline models in accuracy.

02

Hybrid attention modules yield the best performance, especially with EfficientNetB5.

03

Attention mechanisms improve feature localization and model generalization.

Abstract

Deep learning has become a powerful tool for medical image analysis; however, conventional Convolutional Neural Networks (CNNs) often fail to capture the fine-grained and complex features critical for accurate diagnosis. To address this limitation, we systematically integrate attention mechanisms into five widely adopted CNN architectures, namely, VGG16, ResNet18, InceptionV3, DenseNet121, and EfficientNetB5, to enhance their ability to focus on salient regions and improve discriminative performance. Specifically, each baseline model is augmented with either a Squeeze and Excitation block or a hybrid Convolutional Block Attention Module, allowing adaptive recalibration of channel and spatial feature representations. The proposed models are evaluated on two distinct medical imaging datasets, a brain tumor MRI dataset comprising multiple tumor subtypes, and a Products of Conception…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBrain Tumor Detection and Classification