The Adversarial Attack and Detection under the Fisher Information Metric

Chenxiao Zhao; P. Thomas Fletcher; Mixue Yu; Yaxin Peng; Guixu Zhang; and Chaomin Shen

arXiv:1810.03806·cs.LG·February 12, 2019·5 cites

The Adversarial Attack and Detection under the Fisher Information Metric

Chenxiao Zhao, P. Thomas Fletcher, Mixue Yu, Yaxin Peng, Guixu Zhang, and Chaomin Shen

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel adversarial attack and detection method based on the Fisher information metric, providing a geometric understanding of model vulnerability and improving robustness analysis in deep learning.

Contribution

It proposes the one-step spectral attack and eigenvalue-based detection methods using Fisher information geometry, offering new insights into adversarial vulnerability and defense strategies.

Findings

01

The spectral attack effectively identifies vulnerable directions in data space.

02

Eigenvalues correlate with model susceptibility to attacks.

03

The methods outperform existing approaches in efficiency and accuracy.

Abstract

Many deep learning models are vulnerable to the adversarial attack, i.e., imperceptible but intentionally-designed perturbations to the input can cause incorrect output of the networks. In this paper, using information geometry, we provide a reasonable explanation for the vulnerability of deep learning models. By considering the data space as a non-linear space with the Fisher information metric induced from a neural network, we first propose an adversarial attack algorithm termed one-step spectral attack (OSSA). The method is described by a constrained quadratic form of the Fisher information matrix, where the optimal adversarial perturbation is given by the first eigenvector, and the model vulnerability is reflected by the eigenvalues. The larger an eigenvalue is, the more vulnerable the model is to be attacked by the corresponding eigenvector. Taking advantage of the property, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

eliot-tron/curvnetattack
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Bacillus and Francisella bacterial research · Anomaly Detection Techniques and Applications