PopSkipJump: Decision-Based Attack for Probabilistic Classifiers
Carl-Johann Simon-Gabriel, Noman Ahmed Sheikh, Andreas Krause

TL;DR
PopSkipJump is a novel decision-based adversarial attack tailored for probabilistic classifiers, effectively bypassing randomized defenses and maintaining query efficiency across different noise levels.
Contribution
It introduces a probabilistic extension of the HopSkipJump attack, enabling effective attacks on classifiers with stochastic outputs, which was not addressed by prior deterministic-focused methods.
Findings
Effective against state-of-the-art randomized defenses
Maintains query efficiency across noise levels
Successfully attacks probabilistic classifiers
Abstract
Most current classifiers are vulnerable to adversarial examples, small input perturbations that change the classification output. Many existing attack algorithms cover various settings, from white-box to black-box classifiers, but typically assume that the answers are deterministic and often fail when they are not. We therefore propose a new adversarial decision-based attack specifically designed for classifiers with probabilistic outputs. It is based on the HopSkipJump attack by Chen et al. (2019, arXiv:1904.02144v5 ), a strong and query efficient decision-based attack originally designed for deterministic classifiers. Our P(robabilisticH)opSkipJump attack adapts its amount of queries to maintain HopSkipJump's original output quality across various noise levels, while converging to its query efficiency as the noise level decreases. We test our attack on various noise models, including…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Anomaly Detection Techniques and Applications
