Practical Membership Inference Attacks Against Large-Scale Multi-Modal   Models: A Pilot Study

Myeongseob Ko; Ming Jin; Chenguang Wang; and Ruoxi Jia

arXiv:2310.00108·cs.LG·October 3, 2023·1 cites

Practical Membership Inference Attacks Against Large-Scale Multi-Modal Models: A Pilot Study

Myeongseob Ko, Ming Jin, Chenguang Wang, and Ruoxi Jia

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper explores practical membership inference attacks on large-scale multi-modal models like CLIP, introducing simple and enhanced strategies that significantly improve attack success rates, highlighting privacy vulnerabilities.

Contribution

It proposes a baseline cosine similarity thresholding method, enhances it with transformations, and introduces a weakly supervised attack leveraging non-member data, demonstrating effective privacy attacks on large models.

Findings

01

Baseline attack achieves over 75% accuracy.

02

Enhanced attacks outperform baseline across models and datasets.

03

Weakly supervised attack improves performance by 17% on average.

Abstract

Membership inference attacks (MIAs) aim to infer whether a data point has been used to train a machine learning model. These attacks can be employed to identify potential privacy vulnerabilities and detect unauthorized use of personal data. While MIAs have been traditionally studied for simple classification models, recent advancements in multi-modal pre-training, such as CLIP, have demonstrated remarkable zero-shot performance across a range of computer vision tasks. However, the sheer scale of data and models presents significant computational challenges for performing the attacks. This paper takes a first step towards developing practical MIAs against large-scale multi-modal models. We introduce a simple baseline strategy by thresholding the cosine similarity between text and image features of a target point and propose further enhancing the baseline by aggregating cosine…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ruoxi-jia-group/clip-mia
pytorchOfficial

Videos

Practical Membership Inference Attacks Against Large-Scale Multi-Modal Models: A Pilot Study· youtube

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications

MethodsContrastive Language-Image Pre-training