Hidden in the Metadata: Stealth Poisoning Attacks on Multimodal Retrieval-Augmented Generation

Kennedy Edemacu; Mohammad Mahdi Shokri

arXiv:2603.00172·cs.CR·March 3, 2026

Hidden in the Metadata: Stealth Poisoning Attacks on Multimodal Retrieval-Augmented Generation

Kennedy Edemacu, Mohammad Mahdi Shokri

PDF

Open Access

TL;DR

This paper reveals a new vulnerability in multimodal retrieval-augmented generation systems where malicious metadata manipulation can significantly influence model outputs without altering visual content.

Contribution

The paper introduces MM-MEPA, a novel multimodal poisoning attack targeting metadata in image-text data, demonstrating its high success rate and exposing a critical security flaw in RAG systems.

Findings

01

MM-MEPA achieves up to 91% attack success rate.

02

Metadata-only manipulation can steer model responses.

03

Existing defenses are largely ineffective against this attack.

Abstract

Retrieval-augmented generation (RAG) has emerged as a powerful paradigm for enhancing multimodal large language models by grounding their responses in external, factual knowledge and thus mitigating hallucinations. However, the integration of externally sourced knowledge bases introduces a critical attack surface. Adversaries can inject malicious multimodal content capable of influencing both retrieval and downstream generation. In this work, we present MM-MEPA, a multimodal poisoning attack that targets the metadata components of image-text entries while leaving the associated visual content unaltered. By only manipulating the metadata, MM-MEPA can still steer multimodal retrieval and induce attacker-desired model responses. We evaluate the attack across multiple benchmark settings and demonstrate its severity. MM-MEPA achieves an attack success rate of up to 91\% consistently…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Topic Modeling · Adversarial Robustness in Machine Learning