Membership Inference Attacks Against Vision-Language Models
Yuke Hu, Zheng Li, Zhihao Liu, Yang Zhang, Zhan Qin, Kui Ren, Chun, Chen

TL;DR
This paper investigates the privacy risks of vision-language models by developing novel membership inference attacks that can accurately detect whether specific data was used in training, highlighting potential data leakage issues.
Contribution
It introduces four new membership inference methods tailored for vision-language models, addressing limitations of existing techniques and focusing on sensitive instruction tuning data.
Findings
Achieves over 0.8 AUC in membership inference on small sample sets
Demonstrates vulnerability of VLMs to membership inference attacks
Provides a comprehensive evaluation of attack effectiveness
Abstract
Vision-Language Models (VLMs), built on pre-trained vision encoders and large language models (LLMs), have shown exceptional multi-modal understanding and dialog capabilities, positioning them as catalysts for the next technological revolution. However, while most VLM research focuses on enhancing multi-modal interaction, the risks of data misuse and leakage have been largely unexplored. This prompts the need for a comprehensive investigation of such risks in VLMs. In this paper, we conduct the first analysis of misuse and leakage detection in VLMs through the lens of membership inference attack (MIA). In specific, we focus on the instruction tuning data of VLMs, which is more likely to contain sensitive or unauthorized information. To address the limitation of existing MIA methods, we introduce a novel approach that infers membership based on a set of samples and their sensitivity to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Topic Modeling
MethodsSparse Evolutionary Training · Focus
