# Different BI-RADS breast cancer diagnosis using MobileNetV1 and vision transformer based on explainable artificial intelligence (XAI)

**Authors:** Israa Abdelsabour, Ahmed Elgarayhi, Mohammed Sallah, Mohammed Elmogy

PMC · DOI: 10.1038/s41598-026-37199-2 · Scientific Reports · 2026-02-17

## TL;DR

This paper proposes a new AI system combining MobileNetV1 and Vision Transformer for accurate and interpretable breast cancer diagnosis using mammograms.

## Contribution

A novel dual-stream framework using MobileNetV1 and ViT with XAI for multi-class BI-RADS classification of breast cancer.

## Key findings

- The proposed framework achieved over 99% accuracy, sensitivity, and specificity across all BI-RADS categories.
- XAI techniques like Grad-CAM provided interpretable visual explanations for AI-based diagnoses.
- The model outperformed existing CNN and transformer models in multi-class BC classification.

## Abstract

Breast cancer (BC) remains one of the leading causes of death among women in the world, depending on the requirement for precise, effective, and interpretable computer-aided diagnosis systems (CADs). In this work, a hybrid deep learning (DL) framework is presented for multi-class BI-RADS BC classification using mammographic images. This framework fuses MobileNetV1, a lightweight convolutional neural network (CNN), to capture fine-grained local features and combines it with a Vision Transformer (ViT) to model global contextual connections, thereby enabling corresponding representation learning through a dual-stream structure. The evaluation was performed on the publicly available King Abdulaziz University BC Mammogram Dataset (KAUBC), which includes multi-view mammograms (craniocaudal (CC) and mediolateral oblique (MLO)) arranged according to the BI-RADS classification scheme and characterized by class imbalance. Feature-level fusion is performed, followed by a bagging-based logistic regression (LR) classifier to enhance robustness and decrease prediction conflict. The proposed approach was extensively analyzed using 5-fold cross-validation and compared with multiple state-of-the-art CNN and transformer models, each fused with various machine learning (ML) classifiers. The experimental results demonstrate higher and stable performance across all BI-RADS categories, with accuracy (ACC), sensitivity (SEN), and specificity (SPE) exceeding 99%. In addition, explainable artificial intelligence (XAI) techniques, including Grad-CAM and Grad-CAM++, were applied to provide clinically interpretable visual explanations by highlighting diagnostically relevant regions in mammograms. These results indicate that the proposed MobileNetV1–ViT–Bagging framework recommends an effective, computationally structured, and explainable solution for multi-class BI-RADS BC diagnosis, with strong potential for clinical decision-support applications.

## Linked entities

- **Diseases:** breast cancer (MONDO:0004989)

## Full-text entities

- **Genes:** ERBB2 (erb-b2 receptor tyrosine kinase 2) [NCBI Gene 2064] {aka CD340, HER-2, HER-2/neu, HER2, MLN 19, MLN-19}, VIT (vitrin) [NCBI Gene 5212] {aka VIT1}
- **Diseases:** DL (MESH:D007859), lesion (MESH:D009059), AI (MESH:C538142), XAI (MESH:C538243), fatty (MESH:D008067), Malignant (MESH:D009369), calcifications (MESH:D002114), HI (MESH:D009370), BC (MESH:D001943), TN (MESH:C579935), CLAHE (MESH:D018489), PCA (MESH:C562643), BI-RADS (MESH:D061325), death (MESH:D003643), SEN (MESH:D003807)
- **Chemicals:** CLAHE (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12920763/full.md

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12920763/full.md

## References

7 references — full list in the complete paper: https://tomesphere.com/paper/PMC12920763/full.md

---
Source: https://tomesphere.com/paper/PMC12920763