MrM: Black-Box Membership Inference Attacks against Multimodal RAG Systems
Peiru Yang, Jinhua Yin, Haoran Zheng, Xueying Bai, Huili Wang, Yufei Sun, Xintian Li, Shangguang Wang, Yongfeng Huang, Tao Qi

TL;DR
This paper introduces MrM, a novel black-box membership inference attack targeting multimodal RAG systems, revealing privacy vulnerabilities in visual-language models through a perturbation-based approach.
Contribution
It presents the first black-box MIA framework for multimodal RAG systems, combining data perturbation and counterfactual strategies to effectively infer membership information.
Findings
MrM achieves high attack success rates across multiple datasets and models.
The method remains effective under adaptive defenses.
It demonstrates significant privacy risks in multimodal RAG systems.
Abstract
Multimodal retrieval-augmented generation (RAG) systems enhance large vision-language models by integrating cross-modal knowledge, enabling their increasing adoption across real-world multimodal tasks. These knowledge databases may contain sensitive information that requires privacy protection. However, multimodal RAG systems inherently grant external users indirect access to such data, making them potentially vulnerable to privacy attacks, particularly membership inference attacks (MIAs). % Existing MIA methods targeting RAG systems predominantly focus on the textual modality, while the visual modality remains relatively underexplored. To bridge this gap, we propose MrM, the first black-box MIA framework targeted at multimodal RAG systems. It utilizes a multi-object data perturbation framework constrained by counterfactual attacks, which can concurrently induce the RAG systems to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMultimodal Machine Learning Applications · Topic Modeling · Adversarial Robustness in Machine Learning
