AdvQDet: Detecting Query-Based Adversarial Attacks with Adversarial   Contrastive Prompt Tuning

Xin Wang; Kai Chen; Xingjun Ma; Zhineng Chen; Jingjing Chen; Yu-Gang; Jiang

arXiv:2408.01978·cs.CV·August 6, 2024

AdvQDet: Detecting Query-Based Adversarial Attacks with Adversarial Contrastive Prompt Tuning

Xin Wang, Kai Chen, Xingjun Ma, Zhineng Chen, Jingjing Chen, Yu-Gang, Jiang

PDF

Open Access 1 Repo

TL;DR

This paper introduces AdvQDet, a framework that effectively detects query-based adversarial attacks on neural networks using a novel contrastive prompt tuning method to identify similar adversarial examples with high accuracy and efficiency.

Contribution

The paper proposes ACPT, a new contrastive prompt tuning approach for robustly fine-tuning CLIP to detect intermediate adversarial queries, and introduces AdvQDet, a detection framework that outperforms existing methods.

Findings

01

Detects 7 state-of-the-art query-based attacks with over 99% accuracy within 5 queries.

02

ACPT is robust against 3 types of adaptive attacks.

03

Achieves high detection efficiency with minimal queries.

Abstract

Deep neural networks (DNNs) are known to be vulnerable to adversarial attacks even under a black-box setting where the adversary can only query the model. Particularly, query-based black-box adversarial attacks estimate adversarial gradients based on the returned probability vectors of the target model for a sequence of queries. During this process, the queries made to the target model are intermediate adversarial examples crafted at the previous attack step, which share high similarities in the pixel space. Motivated by this observation, stateful detection methods have been proposed to detect and reject query-based attacks. While demonstrating promising results, these methods either have been evaded by more advanced attacks or suffer from low efficiency in terms of the number of shots (queries) required to detect different attacks. Arguably, the key challenge here is to assign high…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xinwong/advqdet
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Network Security and Intrusion Detection

MethodsContrastive Language-Image Pre-training