SCOOTER: A Human Evaluation Framework for Unrestricted Adversarial Examples
Dren Fazlija, Monty-Maximilian Z\"uhlke, Johanna Schrader, Arkadij Orlov, Clara Stein, Iyiola E. Olatunji, Daniel Kudenko

TL;DR
SCOOTER is an open-source framework designed to evaluate the imperceptibility of unrestricted adversarial examples through large-scale human studies and statistical analysis.
Contribution
It introduces best-practice guidelines, software tools, and a benchmark dataset for assessing and comparing the human perceptual quality of adversarial attacks.
Findings
Three attack methods failed to produce imperceptible images according to human evaluation.
GPT-4o can preliminarily detect some adversarial examples but is not fully reliable.
Automated vision systems do not align well with human perception, highlighting the need for ground-truth benchmarks.
Abstract
Unrestricted adversarial attacks aim to fool computer vision models without being constrained by -norm bounds to remain imperceptible to humans, for example, by changing an object's color. This allows attackers to circumvent traditional, norm-bounded defense strategies such as adversarial training or certified defense strategies. However, due to their unrestricted nature, there are also no guarantees of norm-based imperceptibility, necessitating human evaluations to verify just how authentic these adversarial examples look. While some related work assesses this vital quality of adversarial attacks, none provide statistically significant insights. This issue necessitates a unified framework that supports and streamlines such an assessment for evaluating and comparing unrestricted attacks. To close this gap, we introduce SCOOTER - an open-source, statistically powered framework…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
