Verifying the Causes of Adversarial Examples

Honglin Li; Yifei Fan; Frieder Ganz; Anthony Yezzi; Payam Barnaghi

arXiv:2010.09633·cs.LG·October 20, 2020

Verifying the Causes of Adversarial Examples

Honglin Li, Yifei Fan, Frieder Ganz, Anthony Yezzi, Payam Barnaghi

PDF

Open Access

TL;DR

This paper investigates the underlying causes of adversarial examples in neural networks through controlled experiments, identifying geometry and statistical factors as key contributors to their formation.

Contribution

It systematically verifies potential causes of adversarial examples and introduces new techniques to control these effects, enhancing understanding of their origins.

Findings

01

Geometric factors are primary causes of adversarial examples.

02

Statistical factors amplify adversarial phenomena, especially at high confidence levels.

03

Controlled experiments validate the influence of model linearity and category geometry.

Abstract

The robustness of neural networks is challenged by adversarial examples that contain almost imperceptible perturbations to inputs, which mislead a classifier to incorrect outputs in high confidence. Limited by the extreme difficulty in examining a high-dimensional image space thoroughly, research on explaining and justifying the causes of adversarial examples falls behind studies on attacks and defenses. In this paper, we present a collection of potential causes of adversarial examples and verify (or partially verify) them through carefully-designed controlled experiments. The major causes of adversarial examples include model linearity, one-sum constraint, and geometry of the categories. To control the effect of those causes, multiple techniques are applied such as $L_{2}$ normalization, replacement of loss functions, construction of reference datasets, and novel models using multi-layer…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Advanced Neural Network Applications