FADER: Fast Adversarial Example Rejection
Francesco Crecchi, Marco Melis, Angelo Sotgiu, Davide Bacciu, Battista, Biggio

TL;DR
FADER introduces a fast, prototype-efficient adversarial example detection method using RBF networks, significantly reducing runtime complexity while maintaining high accuracy against adversarial attacks.
Contribution
It unifies existing detection methods into a common framework and proposes FADER, a novel RBF-based detector that improves efficiency without losing detection performance.
Findings
Up to 73x prototype reduction on MNIST
Up to 50x prototype reduction on CIFAR10
Maintains classification accuracy on clean and adversarial data
Abstract
Deep neural networks are vulnerable to adversarial examples, i.e., carefully-crafted inputs that mislead classification at test time. Recent defenses have been shown to improve adversarial robustness by detecting anomalous deviations from legitimate training samples at different layer representations - a behavior normally exhibited by adversarial attacks. Despite technical differences, all aforementioned methods share a common backbone structure that we formalize and highlight in this contribution, as it can help in identifying promising research directions and drawbacks of existing methods. The first main contribution of this work is the review of these detection methods in the form of a unifying framework designed to accommodate both existing defenses and newer ones to come. In terms of drawbacks, the overmentioned defenses require comparing input samples against an oversized number…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
