Little Data, Big Impact: Privacy-Aware Visual Language Models via Minimal Tuning

Laurens Samson; Nimrod Barazani; Sennay Ghebreab; Yuki M. Asano

arXiv:2405.17423·cs.CV·May 27, 2025·2 cites

Little Data, Big Impact: Privacy-Aware Visual Language Models via Minimal Tuning

Laurens Samson, Nimrod Barazani, Sennay Ghebreab, Yuki M. Asano

PDF

Open Access

TL;DR

This paper demonstrates that privacy-aware visual language models can be effectively developed using minimal data and carefully designed benchmarks, significantly improving privacy sensitivity without sacrificing overall performance.

Contribution

The authors introduce high-quality privacy benchmarks and a minimal instruction-tuning dataset, enabling substantial privacy-awareness improvements with only 100 samples.

Findings

01

Privacy benchmarks PrivBench and PrivBench-H are effective for evaluation.

02

Fine-tuning with 100 samples improves privacy sensitivity significantly.

03

Privacy-aware models outperform GPT-4 on privacy benchmarks.

Abstract

As Visual Language Models (VLMs) become increasingly embedded in everyday applications, ensuring they can recognize and appropriately handle privacy-sensitive content is essential. We conduct a comprehensive evaluation of ten state-of-the-art VLMs and identify limitations in their understanding of visual privacy. Existing datasets suffer from label inconsistencies, limiting their reliability. To address this, we introduce two compact, high-quality benchmarks, PrivBench and PrivBench-H, that focus on commonly recognized privacy categories aligned with the General Data Protection Regulation (GDPR). Additionally, we present PrivTune, an instruction-tuning dataset specifically curated to improve privacy sensitivity. We obtain a Privacy VLM by fine-tuning an off-the-shelf VLM on only 100 samples from PrivTune, which leads to substantial gains on all benchmarks, surpassing GPT-4, while…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital and Cyber Forensics