FADER: Fast Adversarial Example Rejection

Francesco Crecchi; Marco Melis; Angelo Sotgiu; Davide Bacciu; Battista; Biggio

arXiv:2010.09119·cs.LG·October 20, 2020

FADER: Fast Adversarial Example Rejection

Francesco Crecchi, Marco Melis, Angelo Sotgiu, Davide Bacciu, Battista, Biggio

PDF

TL;DR

FADER introduces a fast, prototype-efficient adversarial example detection method using RBF networks, significantly reducing runtime complexity while maintaining high accuracy against adversarial attacks.

Contribution

It unifies existing detection methods into a common framework and proposes FADER, a novel RBF-based detector that improves efficiency without losing detection performance.

Findings

01

Up to 73x prototype reduction on MNIST

02

Up to 50x prototype reduction on CIFAR10

03

Maintains classification accuracy on clean and adversarial data

Abstract

Deep neural networks are vulnerable to adversarial examples, i.e., carefully-crafted inputs that mislead classification at test time. Recent defenses have been shown to improve adversarial robustness by detecting anomalous deviations from legitimate training samples at different layer representations - a behavior normally exhibited by adversarial attacks. Despite technical differences, all aforementioned methods share a common backbone structure that we formalize and highlight in this contribution, as it can help in identifying promising research directions and drawbacks of existing methods. The first main contribution of this work is the review of these detection methods in the form of a unifying framework designed to accommodate both existing defenses and newer ones to come. In terms of drawbacks, the overmentioned defenses require comparing input samples against an oversized number…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.