Practical Membership Inference Attacks Against Large-Scale Multi-Modal Models: A Pilot Study
Myeongseob Ko, Ming Jin, Chenguang Wang, and Ruoxi Jia

TL;DR
This paper explores practical membership inference attacks on large-scale multi-modal models like CLIP, introducing simple and enhanced strategies that significantly improve attack success rates, highlighting privacy vulnerabilities.
Contribution
It proposes a baseline cosine similarity thresholding method, enhances it with transformations, and introduces a weakly supervised attack leveraging non-member data, demonstrating effective privacy attacks on large models.
Findings
Baseline attack achieves over 75% accuracy.
Enhanced attacks outperform baseline across models and datasets.
Weakly supervised attack improves performance by 17% on average.
Abstract
Membership inference attacks (MIAs) aim to infer whether a data point has been used to train a machine learning model. These attacks can be employed to identify potential privacy vulnerabilities and detect unauthorized use of personal data. While MIAs have been traditionally studied for simple classification models, recent advancements in multi-modal pre-training, such as CLIP, have demonstrated remarkable zero-shot performance across a range of computer vision tasks. However, the sheer scale of data and models presents significant computational challenges for performing the attacks. This paper takes a first step towards developing practical MIAs against large-scale multi-modal models. We introduce a simple baseline strategy by thresholding the cosine similarity between text and image features of a target point and propose further enhancing the baseline by aggregating cosine…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Practical Membership Inference Attacks Against Large-Scale Multi-Modal Models: A Pilot Study· youtube
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications
MethodsContrastive Language-Image Pre-training
