Just How Toxic is Data Poisoning? A Unified Benchmark for Backdoor and   Data Poisoning Attacks

Avi Schwarzschild; Micah Goldblum; Arjun Gupta; John P Dickerson; Tom; Goldstein

arXiv:2006.12557·cs.LG·June 18, 2021·56 cites

Just How Toxic is Data Poisoning? A Unified Benchmark for Backdoor and Data Poisoning Attacks

Avi Schwarzschild, Micah Goldblum, Arjun Gupta, John P Dickerson, Tom, Goldstein

PDF

Open Access 2 Repos 1 Video

TL;DR

This paper introduces a standardized benchmark for evaluating data poisoning and backdoor attacks, revealing their varying effectiveness across different testing scenarios and emphasizing the need for realistic evaluation methods.

Contribution

It provides a unified benchmark framework for fair comparison of data poisoning and backdoor attack methods in realistic settings.

Findings

01

Existing methods may not generalize well to real-world scenarios

02

Data poisoning and backdoor attacks are highly sensitive to testing conditions

03

Standardized benchmarks enable fairer evaluation of attack effectiveness

Abstract

Data poisoning and backdoor attacks manipulate training data in order to cause models to fail during inference. A recent survey of industry practitioners found that data poisoning is the number one concern among threats ranging from model stealing to adversarial attacks. However, it remains unclear exactly how dangerous poisoning methods are and which ones are more effective considering that these methods, even ones with identical objectives, have not been tested in consistent or realistic settings. We observe that data poisoning and backdoor attacks are highly sensitive to variations in the testing setup. Moreover, we find that existing methods may not generalize to realistic settings. While these existing works serve as valuable prototypes for data poisoning, we apply rigorous tests to determine the extent to which we should fear them. In order to promote fair comparison in future…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Just How Toxic is Data Poisoning? A Unified Benchmark for Backdoor and Data Poisoning Attacks· slideslive

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications