WaNet -- Imperceptible Warping-based Backdoor Attack

Anh Nguyen; Anh Tran

arXiv:2102.10369·cs.CR·March 5, 2021·128 cites

WaNet -- Imperceptible Warping-based Backdoor Attack

Anh Nguyen, Anh Tran

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces WaNet, a stealthy backdoor attack using imperceptible warping triggers, which outperforms noise-based methods in human tests and bypasses current defenses across multiple datasets.

Contribution

Proposes a novel warping-based backdoor trigger that is more stealthy and effective than noise perturbation triggers, along with a training mode to evade detection.

Findings

01

Outperforms previous noise-based triggers in human inspection tests.

02

Successfully bypasses state-of-the-art defense methods on multiple datasets.

03

Backdoors are transparent to network inspection, demonstrating high stealthiness.

Abstract

With the thriving of deep learning and the widespread practice of using pre-trained networks, backdoor attacks have become an increasing security threat drawing many research interests in recent years. A third-party model can be poisoned in training to work well in normal conditions but behave maliciously when a trigger pattern appears. However, the existing backdoor attacks are all built on noise perturbation triggers, making them noticeable to humans. In this paper, we instead propose using warping-based triggers. The proposed backdoor outperforms the previous methods in a human inspection test by a wide margin, proving its stealthiness. To make such models undetectable by machine defenders, we propose a novel training mode, called the ``noise mode. The trained networks successfully attack and bypass the state-of-the-art defense methods on standard classification datasets, including…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

VinAIResearch/Warping-based_Backdoor_Attack-release
pytorchOfficial

Videos

WaNet - Imperceptible Warping-based Backdoor Attack· slideslive

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Network Security and Intrusion Detection