Classification Auto-Encoder based Detector against Diverse Data   Poisoning Attacks

Fereshteh Razmi; Li Xiong

arXiv:2108.04206·cs.LG·October 28, 2024·1 cites

Classification Auto-Encoder based Detector against Diverse Data Poisoning Attacks

Fereshteh Razmi, Li Xiong

PDF

Open Access 1 Repo

TL;DR

This paper introduces CAE, a Classification Auto-Encoder based method that detects various data poisoning attacks without needing a clean dataset, improving robustness and accuracy in machine learning models.

Contribution

The paper presents a novel CAE-based detection approach capable of identifying diverse poisoning attacks without prior knowledge or clean data, including an enhanced version CAE+ that does not require clean training data.

Findings

01

Effective detection of multiple poisoning attack types.

02

Maintains detection performance with up to 30% contaminated data.

03

Helps classifiers recover accuracy under attack conditions.

Abstract

Poisoning attacks are a category of adversarial machine learning threats in which an adversary attempts to subvert the outcome of the machine learning systems by injecting crafted data into training data set, thus increasing the machine learning model's test error. The adversary can tamper with the data feature space, data labels, or both, each leading to a different attack strategy with different strengths. Various detection approaches have recently emerged, each focusing on one attack strategy. The Achilles heel of many of these detection approaches is their dependence on having access to a clean, untampered data set. In this paper, we propose CAE, a Classification Auto-Encoder based detector against diverse poisoned data. CAE can detect all forms of poisoning attacks using a combination of reconstruction and classification errors without having any prior knowledge of the attack…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

emory-aims/cae
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Advanced Malware Detection Techniques

MethodsSupport Vector Machine