Why Blocking Targeted Adversarial Perturbations Impairs the Ability to   Learn

Ziv Katzir; Yuval Elovici

arXiv:1907.05718·cs.LG·July 15, 2019·1 cites

Why Blocking Targeted Adversarial Perturbations Impairs the Ability to Learn

Ziv Katzir, Yuval Elovici

PDF

Open Access

TL;DR

This paper investigates the limitations of defensive distillation in neural networks, revealing it effectively defends against non-targeted attacks but fails against targeted ones, which compromise the network's learning ability.

Contribution

It systematically analyzes defensive distillation, demonstrating its vulnerability to targeted attacks and highlighting the fundamental tradeoff between robustness and learnability.

Findings

01

Defensive distillation is effective against non-targeted attacks.

02

Targeted attacks can bypass defensive distillation using simple methods.

03

Blocking targeted attacks impairs the network's learning capacity.

Abstract

Despite their accuracy, neural network-based classifiers are still prone to manipulation through adversarial perturbations. Those perturbations are designed to be misclassified by the neural network, while being perceptually identical to some valid input. The vast majority of attack methods rely on white-box conditions, where the attacker has full knowledge of the attacked network's parameters. This allows the attacker to calculate the network's loss gradient with respect to some valid input and use this gradient in order to create an adversarial example. The task of blocking white-box attacks has proven difficult to solve. While a large number of defense methods have been suggested, they have had limited success. In this work we examine this difficulty and try to understand it. We systematically explore the abilities and limitations of defensive distillation, one of the most promising…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Bacillus and Francisella bacterial research · Advanced Malware Detection Techniques