A Statistical Threshold for Adversarial Classification in Laplace   Mechanisms

Ay\c{s}e \"Unsal; Melek \"Onen

arXiv:2105.05610·cs.IT·June 28, 2021

A Statistical Threshold for Adversarial Classification in Laplace Mechanisms

Ay\c{s}e \"Unsal, Melek \"Onen

PDF

TL;DR

This paper develops a statistical framework to detect adversarial attacks on differentially private Laplace mechanisms by analyzing hypothesis testing thresholds, error probabilities, and ROC curves, enhancing understanding of privacy-attack trade-offs.

Contribution

It introduces a novel hypothesis testing approach to characterize the detection threshold and bias for adversarial attacks on Laplace mechanisms, integrating differential privacy with adversarial detection.

Findings

01

Derived analytical error probabilities for attack detection

02

Presented ROC curves for various privacy and attack parameters

03

Provided bounds on the attack bias for effective detection

Abstract

This paper studies the statistical characterization of detecting an adversary who wants to harm some computation such as machine learning models or aggregation by altering the output of a differentially private mechanism in addition to discovering some information about the underlying dataset. An adversary who is able to modify the published information from a differentially private mechanism aims to maximize the possible damage to the system while remaining undetected. We present a trade-off between the privacy parameter of the system, the sensitivity and the attacker's advantage (the bias) through determining the threshold for the best critical region of the hypothesis testing problem for deciding whether or not the adversary's attack is detected. Such trade-offs are provided for Laplace mechanisms using one-sided and two-sided hypothesis tests. Corresponding error probabilities are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.