Membership Inference Attacks Against Vision-Language Models

Yuke Hu; Zheng Li; Zhihao Liu; Yang Zhang; Zhan Qin; Kui Ren; Chun; Chen

arXiv:2501.18624·cs.CR·February 10, 2025

Membership Inference Attacks Against Vision-Language Models

Yuke Hu, Zheng Li, Zhihao Liu, Yang Zhang, Zhan Qin, Kui Ren, Chun, Chen

PDF

Open Access 1 Repo

TL;DR

This paper investigates the privacy risks of vision-language models by developing novel membership inference attacks that can accurately detect whether specific data was used in training, highlighting potential data leakage issues.

Contribution

It introduces four new membership inference methods tailored for vision-language models, addressing limitations of existing techniques and focusing on sensitive instruction tuning data.

Findings

01

Achieves over 0.8 AUC in membership inference on small sample sets

02

Demonstrates vulnerability of VLMs to membership inference attacks

03

Provides a comprehensive evaluation of attack effectiveness

Abstract

Vision-Language Models (VLMs), built on pre-trained vision encoders and large language models (LLMs), have shown exceptional multi-modal understanding and dialog capabilities, positioning them as catalysts for the next technological revolution. However, while most VLM research focuses on enhancing multi-modal interaction, the risks of data misuse and leakage have been largely unexplored. This prompts the need for a comprehensive investigation of such risks in VLMs. In this paper, we conduct the first analysis of misuse and leakage detection in VLMs through the lens of membership inference attack (MIA). In specific, we focus on the instruction tuning data of VLMs, which is more likely to contain sensitive or unauthorized information. To address the limitation of existing MIA methods, we introduce a novel approach that infers membership based on a set of samples and their sensitivity to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yukehu/vlm_mia
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSemantic Web and Ontologies · Topic Modeling

MethodsSparse Evolutionary Training · Focus