VPD-100K: Towards Generalizable and Fine-grained Visual Privacy Protection

Xiaobin Hu; Enpu Zuo; Lanping Hu; Kaiwen Yang; Dianshu Liao; Tianyi Zhang; Bo Yin; Yinsi Zhou; Shidong Pan; Xiaoyu Sun

arXiv:2605.10229·cs.CV·May 12, 2026

VPD-100K: Towards Generalizable and Fine-grained Visual Privacy Protection

Xiaobin Hu, Enpu Zuo, Lanping Hu, Kaiwen Yang, Dianshu Liao, Tianyi Zhang, Bo Yin, Yinsi Zhou, Shidong Pan, Xiaoyu Sun

PDF

1 Repo

TL;DR

This paper introduces VPD-100K, a large-scale, fine-grained dataset for visual privacy detection, along with a novel frequency-enhanced module to improve detection accuracy in real-world scenarios.

Contribution

The paper provides a comprehensive privacy dataset with detailed annotations and proposes a new frequency-domain attention mechanism to enhance privacy detection models.

Findings

01

VPD-100K contains 100,000 images with 33 fine-grained privacy classes.

02

The frequency-enhanced module improves detection performance on image and video benchmarks.

03

Extensive experiments validate the dataset's usefulness and the effectiveness of the proposed method.

Abstract

Privacy protection has become a critical requirement in the era of ubiquitous visual data sharing, imposing higher demands on efficient and robust privacy detection algorithms. However, current robust detection models are severely hindered by the lack of comprehensive datasets. Existing privacy-oriented datasets often suffer from limited scale, coarse-grained annotations, and narrow domain coverage, failing to capture the intricate details of sensitive information in realworld environments. To bridge this gap, we present a large-scale, fine-grained Visual Privacy Dataset (VPD-100K), designed to facilitate generalized privacy detection. We establish a holistic taxonomy comprising four primary domains: Human Presence, On-Screen Personally Identifiable Information (PII), Physical Identifiers, and Location Indicators, containing 100,000 images annotated with 33 fine-grained classes and over…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://vpd-100k.github.io
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.