Medusa: Cross-Modal Transferable Adversarial Attacks on Multimodal Medical Retrieval-Augmented Generation

Yingjia Shang; Yi Liu; Huimin Wang; Furong Li; Wenfang Sun; Wu Chengyu; Yefeng Zheng

arXiv:2511.19257·cs.CR·November 25, 2025

Medusa: Cross-Modal Transferable Adversarial Attacks on Multimodal Medical Retrieval-Augmented Generation

Yingjia Shang, Yi Liu, Huimin Wang, Furong Li, Wenfang Sun, Wu Chengyu, Yefeng Zheng

PDF

Open Access

TL;DR

Medusa introduces a novel black-box adversarial attack framework targeting multimodal medical retrieval-augmented generation systems, exposing significant vulnerabilities and emphasizing the need for robustness in safety-critical medical AI applications.

Contribution

This work presents the first transferable adversarial attack method on MMed-RAG systems using a multi-positive InfoNCE loss and ensemble strategies, advancing security analysis in medical AI.

Findings

01

Achieves over 90% attack success rate in experiments

02

Remains effective against four mainstream defenses

03

Highlights vulnerabilities in medical retrieval-augmented systems

Abstract

With the rapid advancement of retrieval-augmented vision-language models, multimodal medical retrieval-augmented generation (MMed-RAG) systems are increasingly adopted in clinical decision support. These systems enhance medical applications by performing cross-modal retrieval to integrate relevant visual and textual evidence for tasks, e.g., report generation and disease diagnosis. However, their complex architecture also introduces underexplored adversarial vulnerabilities, particularly via visual input perturbations. In this paper, we propose Medusa, a novel framework for crafting cross-modal transferable adversarial attacks on MMed-RAG systems under a black-box setting. Specifically, Medusa formulates the attack as a perturbation optimization problem, leveraging a multi-positive InfoNCE loss (MPIL) to align adversarial visual embeddings with medically plausible but malicious textual…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Multimodal Machine Learning Applications · Topic Modeling