TL;DR
This paper introduces VPD-100K, a large-scale, fine-grained dataset for visual privacy detection, along with a novel frequency-enhanced module to improve detection accuracy in real-world scenarios.
Contribution
The paper provides a comprehensive privacy dataset with detailed annotations and proposes a new frequency-domain attention mechanism to enhance privacy detection models.
Findings
VPD-100K contains 100,000 images with 33 fine-grained privacy classes.
The frequency-enhanced module improves detection performance on image and video benchmarks.
Extensive experiments validate the dataset's usefulness and the effectiveness of the proposed method.
Abstract
Privacy protection has become a critical requirement in the era of ubiquitous visual data sharing, imposing higher demands on efficient and robust privacy detection algorithms. However, current robust detection models are severely hindered by the lack of comprehensive datasets. Existing privacy-oriented datasets often suffer from limited scale, coarse-grained annotations, and narrow domain coverage, failing to capture the intricate details of sensitive information in realworld environments. To bridge this gap, we present a large-scale, fine-grained Visual Privacy Dataset (VPD-100K), designed to facilitate generalized privacy detection. We establish a holistic taxonomy comprising four primary domains: Human Presence, On-Screen Personally Identifiable Information (PII), Physical Identifiers, and Location Indicators, containing 100,000 images annotated with 33 fine-grained classes and over…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
