TL;DR
This paper investigates adversarial attack detection methods for face recognition systems, focusing on attacks against models used as feature extractors, and demonstrates the effectiveness and generalizability of these detection techniques.
Contribution
It introduces a detection approach tested against classifier and deep feature attacks, showing its effectiveness and ability to generalize across attack types in face recognition systems.
Findings
Detection method effectively identifies classifier attacks.
Deep feature attacks are more effective than classifier attacks in fooling FR systems.
Detection approach generalizes to different attack types.
Abstract
Deep Learning methods have become state-of-the-art for solving tasks such as Face Recognition (FR). Unfortunately, despite their success, it has been pointed out that these learning models are exposed to adversarial inputs - images to which an imperceptible amount of noise for humans is added to maliciously fool a neural network - thus limiting their adoption in real-world applications. While it is true that an enormous effort has been spent in order to train robust models against this type of threat, adversarial detection techniques have recently started to draw attention within the scientific community. A detection approach has the advantage that it does not require to re-train any model, thus it can be added on top of any system. In this context, we present our work on adversarial samples detection in forensics mainly focused on detecting attacks against FR systems in which the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
