One pixel attack for fooling deep neural networks

Jiawei Su; Danilo Vasconcellos Vargas; Sakurai Kouichi

arXiv:1710.08864·cs.LG·October 18, 2019

One pixel attack for fooling deep neural networks

Jiawei Su, Danilo Vasconcellos Vargas, Sakurai Kouichi

PDF

5 Repos 1 Video

TL;DR

This paper introduces a novel one-pixel attack method using differential evolution to fool deep neural networks, revealing their vulnerability even with minimal input modifications.

Contribution

The paper presents a black-box one-pixel attack leveraging differential evolution, demonstrating its effectiveness across multiple datasets and highlighting a new low-dimensional adversarial attack approach.

Findings

01

67.97% of CIFAR-10 images can be fooled with one pixel change

02

16.04% of ImageNet images can be fooled with one pixel change

03

The attack requires minimal adversarial information and is highly effective

Abstract

Recent research has revealed that the output of Deep Neural Networks (DNN) can be easily altered by adding relatively small perturbations to the input vector. In this paper, we analyze an attack in an extremely limited scenario where only one pixel can be modified. For that we propose a novel method for generating one-pixel adversarial perturbations based on differential evolution (DE). It requires less adversarial information (a black-box attack) and can fool more types of networks due to the inherent features of DE. The results show that 67.97% of the natural images in Kaggle CIFAR-10 test dataset and 16.04% of the ImageNet (ILSVRC 2012) test images can be perturbed to at least one target class by modifying just one pixel with 74.03% and 22.91% confidence on average. We also show the same vulnerability on the original CIFAR-10 dataset. Thus, the proposed attack explores a different…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

One Pixel Attack Defeats Neural Networks | Two Minute Papers #240· youtube