DP^2-VL: Private Photo Dataset Protection by Data Poisoning for Vision-Language Models

Hongyi Miao; Jun Jia; Xincheng Wang; Qianli Ma; Wei Sun; Wangqiu Zhou; Dandan Zhu; Yewen Cao; Zhi Liu; Guangtao Zhai

arXiv:2603.23925·cs.CV·March 26, 2026

DP^2-VL: Private Photo Dataset Protection by Data Poisoning for Vision-Language Models

Hongyi Miao, Jun Jia, Xincheng Wang, Qianli Ma, Wei Sun, Wangqiu Zhou, Dandan Zhu, Yewen Cao, Zhi Liu, Guangtao Zhai

PDF

Open Access

TL;DR

This paper identifies privacy risks in vision-language models where private photos can be exploited to reveal sensitive information, and proposes a data poisoning method called DP2-VL to protect such images effectively.

Contribution

It introduces a new privacy threat model, a benchmark dataset for identity-affiliation leakage, and a novel data poisoning framework, DP2-VL, to safeguard private photos in vision-language models.

Findings

01

VLMs can recognize identities from small private datasets.

02

DP2-VL effectively prevents identity leakage across models.

03

The method is robust to various post-processing and protection ratios.

Abstract

Recent advances in visual-language alignment have endowed vision-language models (VLMs) with fine-grained image understanding capabilities. However, this progress also introduces new privacy risks. This paper first proposes a novel privacy threat model named identity-affiliation learning: an attacker fine-tunes a VLM using only a few private photos of a target individual, thereby embedding associations between the target facial identity and their private property and social relationships into the model's internal representations. Once deployed via public APIs, this model enables unauthorized exposure of the target user's private information upon input of their photos. To benchmark VLMs' susceptibility to such identity-affiliation leakage, we introduce the first identity-affiliation dataset comprising seven typical scenarios appearing in private photos. Each scenario is instantiated with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Face recognition and analysis · Multimodal Machine Learning Applications