A Statistical Threshold for Adversarial Classification in Laplace Mechanisms
Ay\c{s}e \"Unsal, Melek \"Onen

TL;DR
This paper develops a statistical framework to detect adversarial attacks on differentially private Laplace mechanisms by analyzing hypothesis testing thresholds, error probabilities, and ROC curves, enhancing understanding of privacy-attack trade-offs.
Contribution
It introduces a novel hypothesis testing approach to characterize the detection threshold and bias for adversarial attacks on Laplace mechanisms, integrating differential privacy with adversarial detection.
Findings
Derived analytical error probabilities for attack detection
Presented ROC curves for various privacy and attack parameters
Provided bounds on the attack bias for effective detection
Abstract
This paper studies the statistical characterization of detecting an adversary who wants to harm some computation such as machine learning models or aggregation by altering the output of a differentially private mechanism in addition to discovering some information about the underlying dataset. An adversary who is able to modify the published information from a differentially private mechanism aims to maximize the possible damage to the system while remaining undetected. We present a trade-off between the privacy parameter of the system, the sensitivity and the attacker's advantage (the bias) through determining the threshold for the best critical region of the hypothesis testing problem for deciding whether or not the adversary's attack is detected. Such trade-offs are provided for Laplace mechanisms using one-sided and two-sided hypothesis tests. Corresponding error probabilities are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
