Data Poisoning against Differentially-Private Learners: Attacks and   Defenses

Yuzhe Ma; Xiaojin Zhu; Justin Hsu

arXiv:1903.09860·cs.LG·July 8, 2019·27 cites

Data Poisoning against Differentially-Private Learners: Attacks and Defenses

Yuzhe Ma, Xiaojin Zhu, Justin Hsu

PDF

Open Access

TL;DR

This paper investigates the effectiveness of differential privacy as a defense against data poisoning attacks in machine learning, showing it provides resistance with limited attack scope but degrades with increased poisoning.

Contribution

It introduces attack algorithms targeting differentially-private learners and evaluates their effectiveness, highlighting the limits of privacy-based defenses against extensive data poisoning.

Findings

01

Differential privacy offers resistance to small-scale poisoning attacks.

02

Effectiveness of defenses diminishes as the attacker poisons more data.

03

Attack algorithms successfully compromise privacy-preserving learners with sufficient data poisoning.

Abstract

Data poisoning attacks aim to manipulate the model produced by a learning algorithm by adversarially modifying the training set. We consider differential privacy as a defensive measure against this type of attack. We show that such learners are resistant to data poisoning attacks when the adversary is only able to poison a small number of items. However, this protection degrades as the adversary poisons more data. To illustrate, we design attack algorithms targeting objective and output perturbation learners, two standard approaches to differentially-private machine learning. Experiments show that our methods are effective when the attacker is allowed to poison sufficiently many training items.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Privacy-Preserving Technologies in Data · Cryptography and Data Security