Unveiling Vulnerabilities in Interpretable Deep Learning Systems with   Query-Efficient Black-box Attacks

Eldor Abdukhamidov; Mohammed Abuhamad; Simon S. Woo; Eric Chan-Tin,; Tamer Abuhmed

arXiv:2307.11906·cs.CV·July 25, 2023·1 cites

Unveiling Vulnerabilities in Interpretable Deep Learning Systems with Query-Efficient Black-box Attacks

Eldor Abdukhamidov, Mohammed Abuhamad, Simon S. Woo, Eric Chan-Tin,, Tamer Abuhmed

PDF

Open Access

TL;DR

This paper introduces a novel query-efficient black-box attack method using microbial genetic algorithms to expose vulnerabilities in Interpretable Deep Learning Systems, demonstrating high success rates and challenging detection.

Contribution

It presents a new microbial genetic algorithm-based attack that requires no prior knowledge, combining transfer and score-based methods to effectively attack IDLSes.

Findings

01

High attack success rates achieved

02

Adversarial examples have attribution maps similar to benign samples

03

Attacks are difficult to detect even by humans

Abstract

Deep learning has been rapidly employed in many applications revolutionizing many industries, but it is known to be vulnerable to adversarial attacks. Such attacks pose a serious threat to deep learning-based systems compromising their integrity, reliability, and trust. Interpretable Deep Learning Systems (IDLSes) are designed to make the system more transparent and explainable, but they are also shown to be susceptible to attacks. In this work, we propose a novel microbial genetic algorithm-based black-box attack against IDLSes that requires no prior knowledge of the target model and its interpretation model. The proposed attack is a query-efficient approach that combines transfer-based and score-based methods, making it a powerful tool to unveil IDLS vulnerabilities. Our experiments of the attack show high attack success rates using adversarial examples with attribution maps that are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)