How Real Is Real? A Human Evaluation Framework for Unrestricted Adversarial Examples
Dren Fazlija, Arkadij Orlov, Johanna Schrader, Monty-Maximilian, Z\"uhlke, Michael Rohs, Daniel Kudenko

TL;DR
This paper introduces SCOOTER, a human evaluation framework designed to assess the perceptual realism of unrestricted adversarial examples in images, addressing a gap in existing evaluation methods.
Contribution
The paper presents SCOOTER, a standardized, statistically rigorous human assessment framework for evaluating the perceptual quality of unrestricted adversarial images.
Findings
SCOOTER enables consistent human evaluation of adversarial image realism.
It provides guidelines and tools for conducting statistically significant experiments.
The framework helps determine if unrestricted attacks are truly imperceptible.
Abstract
With an ever-increasing reliance on machine learning (ML) models in the real world, adversarial examples threaten the safety of AI-based systems such as autonomous vehicles. In the image domain, they represent maliciously perturbed data points that look benign to humans (i.e., the image modification is not noticeable) but greatly mislead state-of-the-art ML models. Previously, researchers ensured the imperceptibility of their altered data points by restricting perturbations via norms. However, recent publications claim that creating natural-looking adversarial examples without such restrictions is also possible. With much more freedom to instill malicious information into data, these unrestricted adversarial examples can potentially overcome traditional defense strategies as they are not constrained by the limitations or patterns these defenses typically recognize and mitigate.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Physical Unclonable Functions (PUFs) and Hardware Security
