Contrastive Privacy: A Semantic Approach to Measuring Privacy of AI-based Sanitization
George Bissias, Eugene Bagdasarian, Brian Neil Levine

TL;DR
This paper introduces contrastive privacy, a formal, model-agnostic, and semantic-based method to quantitatively evaluate the effectiveness of AI-driven sanitization techniques on images and text.
Contribution
It proposes a novel contrastive privacy definition that provides formal guarantees and can operate across multiple media modalities without manual labeling.
Findings
Quantifies sanitization success across 34 image model combinations.
Assesses 15 text models for privacy effectiveness.
Identifies specific failures in sanitization processes.
Abstract
To sanitize specific concepts from imagery and text, privacy mechanisms with formal guarantees are often eschewed in practice in favor of more intuitive techniques. AI-based sanitization is poised to grow in popularity because it can work with the semantics of natural language concepts; e.g., a prompt to "remove faces, clothing, and body shape". Many approaches exist commercially and as prior work. But, the evaluation of such approaches has been bespoke and without formal guarantees. To fill this gap, we propose contrastive privacy, a formal definition of privacy that provides a systematic and quantitative test of sanitized media that has a semantic interpretation. It is independent of the model and mechanism used and operates across multiple media modalities. Contrastive privacy provides guarantees under ideal conditions; and we show how to operationalize the definition with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
