Image Corruption-Inspired Membership Inference Attacks against Large Vision-Language Models

Zongyu Wu; Minhua Lin; Zhiwei Zhang; Fali Wang; Xianren Zhang; Xiang Zhang; Suhang Wang

arXiv:2506.12340·cs.CV·February 5, 2026

Image Corruption-Inspired Membership Inference Attacks against Large Vision-Language Models

Zongyu Wu, Minhua Lin, Zhiwei Zhang, Fali Wang, Xianren Zhang, Xiang Zhang, Suhang Wang

PDF

Open Access 1 Video

TL;DR

This paper introduces a novel membership inference attack method for large vision-language models, leveraging their different responses to image corruption to determine if an image was part of the training data.

Contribution

The paper proposes ICIMIA, a simple yet effective attack method inspired by image corruption sensitivity, applicable in both white-box and black-box scenarios for LVLMs.

Findings

01

ICIMIA achieves high accuracy in membership inference.

02

The method is effective under both white-box and black-box settings.

03

Experiments validate the attack's effectiveness on multiple datasets.

Abstract

Large vision-language models (LVLMs) have demonstrated outstanding performance in many downstream tasks. However, LVLMs are trained on large-scale datasets, which can pose privacy risks if training images contain sensitive information. Therefore, it is important to detect whether an image is used to train the LVLM. Recent studies have investigated membership inference attacks (MIAs) against LVLMs, including detecting image-text pairs and single-modality content. In this work, we focus on detecting whether a target image is used to train the target LVLM. We design simple yet effective Image Corruption-Inspired Membership Inference Attacks (ICIMIA) against LVLMs, which are inspired by LVLM's different sensitivity to image corruption for member and non-member images. We first perform an MIA method under the white-box setting, where we can obtain the embeddings of the image through the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Image Corruption-Inspired Membership Inference Attacks against Large Vision-Language Models· underline

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Artificial Intelligence in Healthcare and Education · COVID-19 diagnosis using AI

MethodsFocus