Adversarial Poisoning Attacks and Defense for General Multi-Class Models Based On Synthetic Reduced Nearest Neighbors
Pooya Tavallali, Vahid Behzadan, Peyman Tavallali, Mukesh Singhal

TL;DR
This paper introduces a model-free label-flipping poisoning attack that exploits data multi-modality, and a defense mechanism using Synthetic Reduced Nearest Neighbor to detect and mitigate such attacks, improving model robustness.
Contribution
It presents a novel, scalable attack method and a corresponding defense technique based on SRNN, applicable to general multi-class models and not limited to specific algorithms.
Findings
Attack doubles error rates with the same budget
Defense detects flipped samples during training
Defense outperforms conventional methods in accuracy recovery
Abstract
State-of-the-art machine learning models are vulnerable to data poisoning attacks whose purpose is to undermine the integrity of the model. However, the current literature on data poisoning attacks is mainly focused on ad hoc techniques that are only applicable to specific machine learning models. Additionally, the existing data poisoning attacks in the literature are limited to either binary classifiers or to gradient-based algorithms. To address these limitations, this paper first proposes a novel model-free label-flipping attack based on the multi-modality of the data, in which the adversary targets the clusters of classes while constrained by a label-flipping budget. The complexity of our proposed attack algorithm is linear in time over the size of the dataset. Also, the proposed attack can increase the error up to two times for the same attack budget. Second, a novel defense…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Network Security and Intrusion Detection
MethodsHigh-Order Consensuses
