Theoretical Guarantees for Model Auditing with Finite Adversaries

Mario Diaz; Peter Kairouz; Jiachun Liao; Lalitha Sankar

arXiv:1911.03405·stat.ML·November 11, 2019·1 cites

Theoretical Guarantees for Model Auditing with Finite Adversaries

Mario Diaz, Peter Kairouz, Jiachun Liao, Lalitha Sankar

PDF

Open Access

TL;DR

This paper provides theoretical insights into the limitations of finite adversaries in detecting privacy violations in models, establishing conditions under which their failure implies the impossibility of success by any stronger adversary.

Contribution

It introduces a theoretical framework that links the capabilities of finite adversaries to the success of model auditing, offering guarantees about privacy violation detection.

Findings

01

Unsuccessful finite adversaries imply no stronger adversary can succeed under certain conditions.

02

The framework quantifies adversary capabilities like network size and side information.

03

Provides theoretical bounds for privacy auditing effectiveness.

Abstract

Privacy concerns have led to the development of privacy-preserving approaches for learning models from sensitive data. Yet, in practice, even models learned with privacy guarantees can inadvertently memorize unique training examples or leak sensitive features. To identify such privacy violations, existing model auditing techniques use finite adversaries defined as machine learning models with (a) access to some finite side information (e.g., a small auditing dataset), and (b) finite capacity (e.g., a fixed neural network architecture). Our work investigates the requirements under which an unsuccessful attempt to identify privacy violations by a finite adversary implies that no stronger adversary can succeed at such a task. We do so via parameters that quantify the capabilities of the finite adversary, including the size of the neural network employed by such an adversary and the amount…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)