Detecting Adversarial Examples - A Lesson from Multimedia Forensics
Pascal Sch\"ottle, Alexander Schl\"ogl, Cecilia Pasquini, and Rainer, B\"ohme

TL;DR
This paper explores the parallels between multimedia forensics and adversarial example detection in machine learning, adapting steganalysis techniques to improve detection of adversarial images crafted by PGD, and demonstrates effectiveness on MNIST.
Contribution
It introduces a linear filter-based detection method for adversarial examples inspired by steganalysis, bridging multimedia forensics and adversarial machine learning.
Findings
The detection method reliably identifies PGD adversarial examples on MNIST.
Combining adversarial re-training with detection reduces attack success.
Steganalysis techniques can be effective in adversarial example detection.
Abstract
Adversarial classification is the task of performing robust classification in the presence of a strategic attacker. Originating from information hiding and multimedia forensics, adversarial classification recently received a lot of attention in a broader security context. In the domain of machine learning-based image classification, adversarial classification can be interpreted as detecting so-called adversarial examples, which are slightly altered versions of benign images. They are specifically crafted to be misclassified with a very high probability by the classifier under attack. Neural networks, which dominate among modern image classifiers, have been shown to be especially vulnerable to these adversarial examples. However, detecting subtle changes in digital images has always been the goal of multimedia forensics and steganalysis. In this paper, we highlight the parallels…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Digital Media Forensic Detection · Advanced Malware Detection Techniques
