UPSET and ANGRI : Breaking High Performance Image Classifiers

Sayantan Sarkar; Ankan Bansal; Upal Mahbub; Rama Chellappa

arXiv:1707.01159·cs.CV·July 6, 2017·90 cites

UPSET and ANGRI : Breaking High Performance Image Classifiers

Sayantan Sarkar, Ankan Bansal, Upal Mahbub, Rama Chellappa

PDF

Open Access

TL;DR

This paper introduces two novel attack methods, UPSET and ANGRI, for fooling high-performance image classifiers by generating universal and image-specific perturbations, demonstrating their effectiveness on MNIST and CIFAR10 datasets.

Contribution

The paper presents two new targeted attack algorithms, UPSET and ANGRI, that effectively fool image classifiers with universal and image-specific perturbations.

Findings

01

Both methods successfully fool classifiers on MNIST and CIFAR10.

02

Universal perturbations can target specific classes effectively.

03

Image-specific perturbations achieve high fooling rates.

Abstract

In this paper, targeted fooling of high performance image classifiers is achieved by developing two novel attack methods. The first method generates universal perturbations for target classes and the second generates image specific perturbations. Extensive experiments are conducted on MNIST and CIFAR10 datasets to provide insights about the proposed algorithms and show their effectiveness.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning