Semantic Adversarial Examples

Hossein Hosseini; Radha Poovendran

arXiv:1804.00499·cs.CV·April 3, 2018

Semantic Adversarial Examples

Hossein Hosseini, Radha Poovendran

PDF

1 Repo

TL;DR

This paper introduces 'Semantic Adversarial Examples' that are visually similar to original images but fool neural networks, using color-shifting transformations based on human shape bias, revealing new vulnerabilities.

Contribution

The paper proposes a novel class of adversarial examples that preserve semantic content while fooling models, using a shape bias-inspired color-shifting method.

Findings

01

VGG16 accuracy drops to 5.7% on adversarial color-shifted images.

02

Semantic adversarial examples are effective despite preserving the original object semantics.

03

The method leverages human shape bias to generate robust adversarial transformations.

Abstract

Deep neural networks are known to be vulnerable to adversarial examples, i.e., images that are maliciously perturbed to fool the model. Generating adversarial examples has been mostly limited to finding small perturbations that maximize the model prediction error. Such images, however, contain artificial perturbations that make them somewhat distinguishable from natural images. This property is used by several defense methods to counter adversarial examples by applying denoising filters or training the model to be robust to small perturbations. In this paper, we introduce a new class of adversarial examples, namely "Semantic Adversarial Examples," as images that are arbitrarily perturbed to fool the model, but in such a way that the modified image semantically represents the same object as the original image. We formulate the problem of generating such images as a constrained…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

HosseinHosseini/Semantic-Adversarial-Examples
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.