CBAM-Enhanced DenseNet121 for Multi-Class Chest X-Ray Classification with Grad-CAM Explainability
Utsho Kumar Dey

TL;DR
This study introduces a CBAM-enhanced DenseNet121 model for multi-class chest X-ray classification, achieving high accuracy and explainability, especially useful in low-resource settings with limited radiologist access.
Contribution
It presents a novel transfer-learning framework integrating CBAM into DenseNet121 for three-class pneumonia detection, with comprehensive baseline comparisons and interpretability analysis.
Findings
CBAM-DenseNet121 achieves 84.29% test accuracy.
Model's Grad-CAM visualizations focus on relevant pulmonary regions.
EfficientNetB3 underperforms compared to a custom CNN baseline.
Abstract
Pneumonia remains a leading cause of childhood mortality worldwide, with a heavy burden in low-resource settings such as Bangladesh where radiologist availability is limited. Most existing deep learning approaches treat pneumonia detection as a binary problem, overlooking the clinically critical distinction between bacterial and viral aetiology. This paper proposes CBAM-DenseNet121, a transfer-learning framework that integrates the Convolutional Block Attention Module (CBAM) into DenseNet121 for three-class chest X-ray classification: Normal, Bacterial Pneumonia, and Viral Pneumonia. We also conduct a systematic binary-task baseline study revealing that EfficientNetB3 (73.88%) underperforms even the custom CNN baseline (78.53%) -- a practically important negative finding for medical imaging model selection. To ensure statistical reliability, all experiments were repeated three times…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
