On the Effectiveness of Mitigating Data Poisoning Attacks with Gradient Shaping
Sanghyun Hong, Varun Chandrasekaran, Yi\u{g}itcan Kaya, Tudor, Dumitra\c{s}, Nicolas Papernot

TL;DR
This paper investigates a generic defense mechanism against data poisoning attacks in machine learning by shaping gradients to reduce their susceptibility, using differential privacy techniques to improve robustness.
Contribution
It introduces the concept of gradient shaping as a generic defense against poisoning attacks and evaluates its effectiveness using DP-SGD, showing increased robustness.
Findings
DP-SGD increases robustness to indiscriminate attacks
Gradient artifacts include higher magnitude and different orientation
Gradient shaping offers a promising direction for defense
Abstract
Machine learning algorithms are vulnerable to data poisoning attacks. Prior taxonomies that focus on specific scenarios, e.g., indiscriminate or targeted, have enabled defenses for the corresponding subset of known attacks. Yet, this introduces an inevitable arms race between adversaries and defenders. In this work, we study the feasibility of an attack-agnostic defense relying on artifacts that are common to all poisoning attacks. Specifically, we focus on a common element between all attacks: they modify gradients computed to train the model. We identify two main artifacts of gradients computed in the presence of poison: (1) their norms have significantly higher magnitudes than those of clean gradients, and (2) their orientation differs from clean gradients. Based on these observations, we propose the prerequisite for a generic poisoning defense: it must bound gradient…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Privacy-Preserving Technologies in Data · Cryptography and Data Security
