Adversarial Poisoning Attacks and Defense for General Multi-Class Models   Based On Synthetic Reduced Nearest Neighbors

Pooya Tavallali; Vahid Behzadan; Peyman Tavallali; Mukesh Singhal

arXiv:2102.05867·cs.LG·February 12, 2021·1 cites

Adversarial Poisoning Attacks and Defense for General Multi-Class Models Based On Synthetic Reduced Nearest Neighbors

Pooya Tavallali, Vahid Behzadan, Peyman Tavallali, Mukesh Singhal

PDF

Open Access

TL;DR

This paper introduces a model-free label-flipping poisoning attack that exploits data multi-modality, and a defense mechanism using Synthetic Reduced Nearest Neighbor to detect and mitigate such attacks, improving model robustness.

Contribution

It presents a novel, scalable attack method and a corresponding defense technique based on SRNN, applicable to general multi-class models and not limited to specific algorithms.

Findings

01

Attack doubles error rates with the same budget

02

Defense detects flipped samples during training

03

Defense outperforms conventional methods in accuracy recovery

Abstract

State-of-the-art machine learning models are vulnerable to data poisoning attacks whose purpose is to undermine the integrity of the model. However, the current literature on data poisoning attacks is mainly focused on ad hoc techniques that are only applicable to specific machine learning models. Additionally, the existing data poisoning attacks in the literature are limited to either binary classifiers or to gradient-based algorithms. To address these limitations, this paper first proposes a novel model-free label-flipping attack based on the multi-modality of the data, in which the adversary targets the clusters of classes while constrained by a label-flipping budget. The complexity of our proposed attack algorithm is linear in time over the size of the dataset. Also, the proposed attack can increase the error up to two times for the same attack budget. Second, a novel defense…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Network Security and Intrusion Detection

MethodsHigh-Order Consensuses