Detecting Adversarial Examples - A Lesson from Multimedia Forensics

Pascal Sch\"ottle; Alexander Schl\"ogl; Cecilia Pasquini; and Rainer; B\"ohme

arXiv:1803.03613·cs.CR·March 12, 2018·5 cites

Detecting Adversarial Examples - A Lesson from Multimedia Forensics

Pascal Sch\"ottle, Alexander Schl\"ogl, Cecilia Pasquini, and Rainer, B\"ohme

PDF

Open Access

TL;DR

This paper explores the parallels between multimedia forensics and adversarial example detection in machine learning, adapting steganalysis techniques to improve detection of adversarial images crafted by PGD, and demonstrates effectiveness on MNIST.

Contribution

It introduces a linear filter-based detection method for adversarial examples inspired by steganalysis, bridging multimedia forensics and adversarial machine learning.

Findings

01

The detection method reliably identifies PGD adversarial examples on MNIST.

02

Combining adversarial re-training with detection reduces attack success.

03

Steganalysis techniques can be effective in adversarial example detection.

Abstract

Adversarial classification is the task of performing robust classification in the presence of a strategic attacker. Originating from information hiding and multimedia forensics, adversarial classification recently received a lot of attention in a broader security context. In the domain of machine learning-based image classification, adversarial classification can be interpreted as detecting so-called adversarial examples, which are slightly altered versions of benign images. They are specifically crafted to be misclassified with a very high probability by the classifier under attack. Neural networks, which dominate among modern image classifiers, have been shown to be especially vulnerable to these adversarial examples. However, detecting subtle changes in digital images has always been the goal of multimedia forensics and steganalysis. In this paper, we highlight the parallels…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Digital Media Forensic Detection · Advanced Malware Detection Techniques