Obfuscation via Information Density Estimation

Hsiang Hsu; Shahab Asoodeh; Flavio du Pin Calmon

arXiv:1910.08109·cs.IT·October 21, 2019·6 cites

Obfuscation via Information Density Estimation

Hsiang Hsu, Shahab Asoodeh, Flavio du Pin Calmon

PDF

Open Access

TL;DR

This paper introduces a data-driven framework for identifying and obfuscating features that leak sensitive information using a novel information density estimator, with proven leakage guarantees and practical implementation on real datasets.

Contribution

We propose a new framework utilizing information density estimation to identify and obfuscate leaking features, including a novel estimator called TIDE with theoretical guarantees.

Findings

01

Effective identification of information-leaking features.

02

Successful implementation of obfuscation with leakage guarantees.

03

Validation on three real-world datasets.

Abstract

Identifying features that leak information about sensitive attributes is a key challenge in the design of information obfuscation mechanisms. In this paper, we propose a framework to identify information-leaking features via information density estimation. Here, features whose information densities exceed a pre-defined threshold are deemed information-leaking features. Once these features are identified, we sequentially pass them through a targeted obfuscation mechanism with a provable leakage guarantee in terms of $E_{γ}$ -divergence. The core of this mechanism relies on a data-driven estimate of the trimmed information density for which we propose a novel estimator, named the trimmed information density estimator (TIDE). We then use TIDE to implement our mechanism on three real-world datasets. Our approach can be used as a data-driven pipeline for designing obfuscation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Privacy-Preserving Technologies in Data · Anomaly Detection Techniques and Applications