How to make Medical AI Systems safer? Simulating Vulnerabilities, and Threats in Multimodal Medical RAG System

Kaiwen Zuo; Zelin Liu; Raman Dutt; Ziyang Wang; Zhongtian Sun; Fan Mo; Pietro Li\`o

arXiv:2508.17215·cs.LG·January 6, 2026

How to make Medical AI Systems safer? Simulating Vulnerabilities, and Threats in Multimodal Medical RAG System

Kaiwen Zuo, Zelin Liu, Raman Dutt, Ziyang Wang, Zhongtian Sun, Fan Mo, Pietro Li\`o

PDF

TL;DR

This paper introduces MedThreatRAG, a framework for systematically testing vulnerabilities in medical multimodal RAG systems by injecting adversarial image-text pairs, revealing significant security gaps and guiding safer system design.

Contribution

We propose MedThreatRAG, a novel poisoning framework with Cross-Modal Conflict Injection to expose vulnerabilities in medical RAG systems under realistic semi-open attack scenarios.

Findings

01

MedThreatRAG significantly reduces answer accuracy in tested systems.

02

Cross-Modal Conflict Injection causes severe degradation in retrieval and generation.

03

Our results highlight critical security gaps in current clinical RAG systems.

Abstract

Large Vision-Language Models (LVLMs) augmented with Retrieval-Augmented Generation (RAG) are increasingly employed in medical AI to enhance factual grounding through external clinical image-text retrieval. However, this reliance creates a significant attack surface. We propose MedThreatRAG, a novel multimodal poisoning framework that systematically probes vulnerabilities in medical RAG systems by injecting adversarial image-text pairs. A key innovation of our approach is the construction of a simulated semi-open attack environment, mimicking real-world medical systems that permit periodic knowledge base updates via user or pipeline contributions. Within this setting, we introduce and emphasize Cross-Modal Conflict Injection (CMCI), which embeds subtle semantic contradictions between medical images and their paired reports. These mismatches degrade retrieval and generation by disrupting…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.