Theoretically Principled Trade-off for Stateful Defenses against   Query-Based Black-Box Attacks

Ashish Hooda; Neal Mangaokar; Ryan Feng; Kassem Fawaz; Somesh Jha,; Atul Prakash

arXiv:2307.16331·cs.LG·August 1, 2023·1 cites

Theoretically Principled Trade-off for Stateful Defenses against Query-Based Black-Box Attacks

Ashish Hooda, Neal Mangaokar, Ryan Feng, Kassem Fawaz, Somesh Jha,, Atul Prakash

PDF

Open Access

TL;DR

This paper provides a theoretical framework for understanding the fundamental trade-offs in stateful defenses against query-based black-box adversarial attacks, supported by empirical validation.

Contribution

It offers the first formal analysis of detection versus false positive trade-offs in stateful defenses, including upper bounds and impact on attack convergence.

Findings

01

Theoretical upper bounds for detection rates are established.

02

Trade-offs significantly influence attack success and false positive rates.

03

Empirical results validate the theoretical analysis across datasets.

Abstract

Adversarial examples threaten the integrity of machine learning systems with alarming success rates even under constrained black-box conditions. Stateful defenses have emerged as an effective countermeasure, detecting potential attacks by maintaining a buffer of recent queries and detecting new queries that are too similar. However, these defenses fundamentally pose a trade-off between attack detection and false positive rates, and this trade-off is typically optimized by hand-picking feature extractors and similarity thresholds that empirically work well. There is little current understanding as to the formal limits of this trade-off and the exact properties of the feature extractors/underlying problem domain that influence it. This work aims to address this gap by offering a theoretical characterization of the trade-off between detection and false positive rates for stateful defenses.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Explainable Artificial Intelligence (XAI)