Detection of Iterative Adversarial Attacks via Counter Attack

Matthias Rottmann; Kira Maag; Mathis Peyron; Natasa Krejic; Hanno; Gottschalk

arXiv:2009.11397·cs.LG·March 24, 2021·1 cites

Detection of Iterative Adversarial Attacks via Counter Attack

Matthias Rottmann, Kira Maag, Mathis Peyron, Natasa Krejic, Hanno, Gottschalk

PDF

Open Access

TL;DR

This paper demonstrates that the Carlini-Wagner iterative attack can be used as an effective detector for adversarial examples, achieving near-optimal separation with high AUROC scores on CIFAR10 and ImageNet.

Contribution

It provides a mathematical proof that CW attacks can serve as detectors and validates this with numerical experiments showing high detection accuracy.

Findings

01

AUROC up to 99.73% on CIFAR10

02

AUROC up to 99.73% on ImageNet

03

CW attack-based detector outperforms many existing methods

Abstract

Deep neural networks (DNNs) have proven to be powerful tools for processing unstructured data. However for high-dimensional data, like images, they are inherently vulnerable to adversarial attacks. Small almost invisible perturbations added to the input can be used to fool DNNs. Various attacks, hardening methods and detection methods have been introduced in recent years. Notoriously, Carlini-Wagner (CW) type attacks computed by iterative minimization belong to those that are most difficult to detect. In this work we outline a mathematical proof that the CW attack can be used as a detector itself. That is, under certain assumptions and in the limit of attack iterations this detector provides asymptotically optimal separation of original and attacked images. In numerical experiments, we experimentally validate this statement and furthermore obtain AUROC values up to 99.73% on CIFAR10 and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · COVID-19 diagnosis using AI