DP^2-VL: Private Photo Dataset Protection by Data Poisoning for Vision-Language Models
Hongyi Miao, Jun Jia, Xincheng Wang, Qianli Ma, Wei Sun, Wangqiu Zhou, Dandan Zhu, Yewen Cao, Zhi Liu, Guangtao Zhai

TL;DR
This paper identifies privacy risks in vision-language models where private photos can be exploited to reveal sensitive information, and proposes a data poisoning method called DP2-VL to protect such images effectively.
Contribution
It introduces a new privacy threat model, a benchmark dataset for identity-affiliation leakage, and a novel data poisoning framework, DP2-VL, to safeguard private photos in vision-language models.
Findings
VLMs can recognize identities from small private datasets.
DP2-VL effectively prevents identity leakage across models.
The method is robust to various post-processing and protection ratios.
Abstract
Recent advances in visual-language alignment have endowed vision-language models (VLMs) with fine-grained image understanding capabilities. However, this progress also introduces new privacy risks. This paper first proposes a novel privacy threat model named identity-affiliation learning: an attacker fine-tunes a VLM using only a few private photos of a target individual, thereby embedding associations between the target facial identity and their private property and social relationships into the model's internal representations. Once deployed via public APIs, this model enables unauthorized exposure of the target user's private information upon input of their photos. To benchmark VLMs' susceptibility to such identity-affiliation leakage, we introduce the first identity-affiliation dataset comprising seven typical scenarios appearing in private photos. Each scenario is instantiated with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Face recognition and analysis · Multimodal Machine Learning Applications
