Trade-offs and Guarantees of Adversarial Representation Learning for Information Obfuscation
Han Zhao, Jianfeng Chi, Yuan Tian, Geoffrey J. Gordon

TL;DR
This paper develops a theoretical framework for attribute obfuscation in machine learning, analyzing the trade-off between data privacy and task accuracy, and demonstrates the effectiveness of adversarial learning in balancing these goals.
Contribution
It introduces a minimax optimization framework for attribute obfuscation, providing theoretical guarantees and characterizing the fundamental privacy-accuracy trade-off.
Findings
Adversarial learning achieves the best trade-off between privacy and accuracy.
Theoretical lower bounds characterize the privacy-accuracy trade-off.
Experimental results validate the inference guarantees and trade-off analysis.
Abstract
Crowdsourced data used in machine learning services might carry sensitive information about attributes that users do not want to share. Various methods have been proposed to minimize the potential information leakage of sensitive attributes while maximizing the task accuracy. However, little is known about the theory behind these methods. In light of this gap, we develop a novel theoretical framework for attribute obfuscation. Under our framework, we propose a minimax optimization formulation to protect the given attribute and analyze its inference guarantees against worst-case adversaries. Meanwhile, it is clear that in general there is a tension between minimizing information leakage and maximizing task accuracy. To understand this, we prove an information-theoretic lower bound to precisely characterize the fundamental trade-off between accuracy and information leakage. We conduct…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Privacy-Preserving Technologies in Data · Mobile Crowdsensing and Crowdsourcing
