Universal adversarial perturbations

Seyed-Mohsen Moosavi-Dezfooli; Alhussein Fawzi; Omar Fawzi; Pascal; Frossard

arXiv:1610.08401·cs.CV·March 10, 2017·91 cites

Universal adversarial perturbations

Seyed-Mohsen Moosavi-Dezfooli, Alhussein Fawzi, Omar Fawzi, Pascal, Frossard

PDF

Open Access 5 Repos 1 Video

TL;DR

This paper demonstrates the existence of small, universal perturbations that can fool deep neural networks across many images, revealing vulnerabilities and security concerns in current classifiers.

Contribution

It introduces a systematic algorithm for computing universal adversarial perturbations and shows their high transferability across different neural networks.

Findings

01

Universal perturbations cause misclassification with high probability.

02

Deep neural networks are highly vulnerable to these perturbations.

03

Universal perturbations generalize well across different models.

Abstract

Given a state-of-the-art deep neural network classifier, we show the existence of a universal (image-agnostic) and very small perturbation vector that causes natural images to be misclassified with high probability. We propose a systematic algorithm for computing universal perturbations, and show that state-of-the-art deep neural networks are highly vulnerable to such perturbations, albeit being quasi-imperceptible to the human eye. We further empirically analyze these universal perturbations and show, in particular, that they generalize very well across neural networks. The surprising existence of universal perturbations reveals important geometric correlations among the high-dimensional decision boundary of classifiers. It further outlines potential security breaches with the existence of single directions in the input space that adversaries can possibly exploit to break a classifier…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Universal Adversarial Perturbations· youtube

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Domain Adaptation and Few-Shot Learning