Are DNNs fooled by extremely unrecognizable images?

Soichiro Kumano; Hiroshi Kera; Toshihiko Yamasaki

arXiv:2012.03843·cs.CV·March 29, 2022

Are DNNs fooled by extremely unrecognizable images?

Soichiro Kumano, Hiroshi Kera, Toshihiko Yamasaki

PDF

Open Access

TL;DR

This paper investigates whether deep neural networks can be fooled by extremely unrecognizable images, introducing sparse fooling images (SFIs) that lack natural object features and demonstrating their effectiveness in deceiving DNNs, especially in deeper layers.

Contribution

The study introduces SFIs, a minimal class of fooling images with no natural object features, and proves their existence for various models, revealing new vulnerabilities in DNNs.

Findings

01

SFIs can fool DNNs in deeper layers

02

Complex models are more vulnerable to SFI attacks

03

Max pooling layers contribute to vulnerability

Abstract

Fooling images are a potential threat to deep neural networks (DNNs). These images are not recognizable to humans as natural objects, such as dogs and cats, but are misclassified by DNNs as natural-object classes with high confidence scores. Despite their original design concept, existing fooling images retain some features that are characteristic of the target objects if looked into closely. Hence, DNNs can react to these features. In this paper, we address the question of whether there can be fooling images with no characteristic pattern of natural objects locally or globally. As a minimal case, we introduce single-color images with a few pixels altered, called sparse fooling images (SFIs). We first prove that SFIs always exist under mild conditions for linear and nonlinear models and reveal that complex models are more likely to be vulnerable to SFI attacks. With two SFI generation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis · Advanced Neural Network Applications