Friendly Noise against Adversarial Noise: A Powerful Defense against Data Poisoning Attacks
Tian Yu Liu, Yu Yang, Baharan Mirzasoleiman

TL;DR
This paper introduces a simple, effective defense against invisible data poisoning attacks that minimally impacts model performance by adding optimized friendly noise and random noise to training data, thwarting various attack types.
Contribution
The authors propose a novel defense method combining optimized friendly noise and random noise to counteract diverse invisible poisoning attacks with minimal performance loss.
Findings
Effective against triggerless targeted attacks
Maintains high generalization performance
Transferable across architectures
Abstract
A powerful category of (invisible) data poisoning attacks modify a subset of training examples by small adversarial perturbations to change the prediction of certain test-time data. Existing defense mechanisms are not desirable to deploy in practice, as they often either drastically harm the generalization performance, or are attack-specific, and prohibitively slow to apply. Here, we propose a simple but highly effective approach that unlike existing methods breaks various types of invisible poisoning attacks with the slightest drop in the generalization performance. We make the key observation that attacks introduce local sharp regions of high training loss, which when minimized, results in learning the adversarial perturbations and makes the attack successful. To break poisoning attacks, our key idea is to alleviate the sharp loss regions introduced by poisons. To do so, our approach…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Advanced Neural Network Applications
