Theoretical Guarantees for Model Auditing with Finite Adversaries
Mario Diaz, Peter Kairouz, Jiachun Liao, Lalitha Sankar

TL;DR
This paper provides theoretical insights into the limitations of finite adversaries in detecting privacy violations in models, establishing conditions under which their failure implies the impossibility of success by any stronger adversary.
Contribution
It introduces a theoretical framework that links the capabilities of finite adversaries to the success of model auditing, offering guarantees about privacy violation detection.
Findings
Unsuccessful finite adversaries imply no stronger adversary can succeed under certain conditions.
The framework quantifies adversary capabilities like network size and side information.
Provides theoretical bounds for privacy auditing effectiveness.
Abstract
Privacy concerns have led to the development of privacy-preserving approaches for learning models from sensitive data. Yet, in practice, even models learned with privacy guarantees can inadvertently memorize unique training examples or leak sensitive features. To identify such privacy violations, existing model auditing techniques use finite adversaries defined as machine learning models with (a) access to some finite side information (e.g., a small auditing dataset), and (b) finite capacity (e.g., a fixed neural network architecture). Our work investigates the requirements under which an unsuccessful attempt to identify privacy violations by a finite adversary implies that no stronger adversary can succeed at such a task. We do so via parameters that quantify the capabilities of the finite adversary, including the size of the neural network employed by such an adversary and the amount…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)
