How Real Is Real? A Human Evaluation Framework for Unrestricted   Adversarial Examples

Dren Fazlija; Arkadij Orlov; Johanna Schrader; Monty-Maximilian; Z\"uhlke; Michael Rohs; Daniel Kudenko

arXiv:2404.12653·cs.AI·April 22, 2024

How Real Is Real? A Human Evaluation Framework for Unrestricted Adversarial Examples

Dren Fazlija, Arkadij Orlov, Johanna Schrader, Monty-Maximilian, Z\"uhlke, Michael Rohs, Daniel Kudenko

PDF

Open Access

TL;DR

This paper introduces SCOOTER, a human evaluation framework designed to assess the perceptual realism of unrestricted adversarial examples in images, addressing a gap in existing evaluation methods.

Contribution

The paper presents SCOOTER, a standardized, statistically rigorous human assessment framework for evaluating the perceptual quality of unrestricted adversarial images.

Findings

01

SCOOTER enables consistent human evaluation of adversarial image realism.

02

It provides guidelines and tools for conducting statistically significant experiments.

03

The framework helps determine if unrestricted attacks are truly imperceptible.

Abstract

With an ever-increasing reliance on machine learning (ML) models in the real world, adversarial examples threaten the safety of AI-based systems such as autonomous vehicles. In the image domain, they represent maliciously perturbed data points that look benign to humans (i.e., the image modification is not noticeable) but greatly mislead state-of-the-art ML models. Previously, researchers ensured the imperceptibility of their altered data points by restricting perturbations via $ℓ_{p}$ norms. However, recent publications claim that creating natural-looking adversarial examples without such restrictions is also possible. With much more freedom to instill malicious information into data, these unrestricted adversarial examples can potentially overcome traditional defense strategies as they are not constrained by the limitations or patterns these defenses typically recognize and mitigate.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Physical Unclonable Functions (PUFs) and Hardware Security