Cross-Attention Multimodal Fusion for Breast Cancer Diagnosis: Integrating Mammography and Clinical Data with Explainability

Muhaisin Tiyumba Nantogmah; Abdul-Barik Alhassan; Salamudeen Alhassan

arXiv:2508.16000·eess.IV·August 25, 2025

Cross-Attention Multimodal Fusion for Breast Cancer Diagnosis: Integrating Mammography and Clinical Data with Explainability

Muhaisin Tiyumba Nantogmah, Abdul-Barik Alhassan, Salamudeen Alhassan

PDF

TL;DR

This paper proposes a cross-attention multimodal fusion model that combines mammography images and clinical data to improve breast cancer diagnosis, achieving high accuracy and interpretability on public datasets.

Contribution

It introduces a novel multimodal deep learning approach using cross-attention for integrating imaging and clinical data in breast cancer diagnosis.

Findings

01

Achieved an AUC-ROC of 0.98 on public datasets

02

Improved diagnostic accuracy with multimodal data

03

Enhanced model interpretability through explainability methods

Abstract

A precise assessment of the risk of breast lesions can greatly lower it and assist physicians in choosing the best course of action. To categorise breast lesions, the majority of current computer-aided systems only use characteristics from mammograms. Although this method is practical, it does not completely utilise clinical reports' valuable information to attain the best results. When compared to utilising mammography alone, will clinical features greatly enhance the categorisation of breast lesions? How may clinical features and mammograms be combined most effectively? In what ways may explainable AI approaches improve the interpretability and reliability of models used to diagnose breast cancer? To answer these basic problems, a comprehensive investigation is desperately needed. In order to integrate mammography and categorical clinical characteristics, this study examines a number…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.