Backdoor Smoothing: Demystifying Backdoor Attacks on Deep Neural   Networks

Kathrin Grosse; Taesung Lee; Battista Biggio; Youngja Park; Michael; Backes; Ian Molloy

arXiv:2006.06721·cs.LG·November 3, 2021

Backdoor Smoothing: Demystifying Backdoor Attacks on Deep Neural Networks

Kathrin Grosse, Taesung Lee, Battista Biggio, Youngja Park, Michael, Backes, Ian Molloy

PDF

Open Access

TL;DR

This paper investigates how backdoor attacks create smoother decision boundaries in neural networks around triggered inputs, revealing a phenomenon called backdoor smoothing that affects attack success and defense strategies.

Contribution

It introduces the concept of backdoor smoothing, quantifies it with a new measure, and shows its relation to attack success and potential for detecting artificial patterns.

Findings

01

Backdoor attacks induce increased smoothness around triggered samples.

02

Smoother decision functions correlate with more successful attacks.

03

Other artificial patterns can also cause smoothing, affecting defense methods.

Abstract

Backdoor attacks mislead machine-learning models to output an attacker-specified class when presented a specific trigger at test time. These attacks require poisoning the training data to compromise the learning algorithm, e.g., by injecting poisoning samples containing the trigger into the training set, along with the desired class label. Despite the increasing number of studies on backdoor attacks and defenses, the underlying factors affecting the success of backdoor attacks, along with their impact on the learning algorithm, are not yet well understood. In this work, we aim to shed light on this issue by unveiling that backdoor attacks induce a smoother decision function around the triggered samples -- a phenomenon which we refer to as \textit{backdoor smoothing}. To quantify backdoor smoothing, we define a measure that evaluates the uncertainty associated to the predictions of a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Anomaly Detection Techniques and Applications