A Synergetic Attack against Neural Network Classifiers combining   Backdoor and Adversarial Examples

Guanxiong Liu; Issa Khalil; Abdallah Khreishah; NhatHai Phan

arXiv:2109.01275·cs.CR·September 6, 2021

A Synergetic Attack against Neural Network Classifiers combining Backdoor and Adversarial Examples

Guanxiong Liu, Issa Khalil, Abdallah Khreishah, NhatHai Phan

PDF

Open Access

TL;DR

This paper introduces AdvTrojan, a novel stealthy attack that combines adversarial perturbations and Trojan backdoors to bypass defenses and compromise neural network classifiers.

Contribution

The work presents a new attack method, AdvTrojan, that jointly exploits adversarial and backdoor vulnerabilities, demonstrating high success rates against existing defenses.

Findings

01

AdvTrojan achieves nearly 100% success rate in experiments.

02

It can bypass current defenses effectively.

03

The attack extends to federated learning scenarios.

Abstract

In this work, we show how to jointly exploit adversarial perturbation and model poisoning vulnerabilities to practically launch a new stealthy attack, dubbed AdvTrojan. AdvTrojan is stealthy because it can be activated only when: 1) a carefully crafted adversarial perturbation is injected into the input examples during inference, and 2) a Trojan backdoor is implanted during the training process of the model. We leverage adversarial noise in the input space to move Trojan-infected examples across the model decision boundary, making it difficult to detect. The stealthiness behavior of AdvTrojan fools the users into accidentally trust the infected model as a robust classifier against adversarial examples. AdvTrojan can be implemented by only poisoning the training data similar to conventional Trojan backdoor attacks. Our thorough analysis and extensive experiments on several benchmark…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning