Ask, Attend, Attack: A Effective Decision-Based Black-Box Targeted   Attack for Image-to-Text Models

Qingyuan Zeng; Zhenzhong Wang; Yiu-ming Cheung; Min Jiang

arXiv:2408.08989·cs.AI·August 20, 2024

Ask, Attend, Attack: A Effective Decision-Based Black-Box Targeted Attack for Image-to-Text Models

Qingyuan Zeng, Zhenzhong Wang, Yiu-ming Cheung, Min Jiang

PDF

Open Access

TL;DR

This paper introduces a novel decision-based black-box targeted attack method for image-to-text models, utilizing a three-stage process to improve attack success while maintaining semantic integrity.

Contribution

The paper proposes the AAA framework, a new three-stage approach for decision-based black-box targeted attacks on image-to-text models, addressing semantic loss issues in prior methods.

Findings

01

Effective targeted attacks demonstrated on transformer-based models

02

Reduces semantic loss compared to gray-box attacks

03

Achieves high attack success rate in black-box setting

Abstract

While image-to-text models have demonstrated significant advancements in various vision-language tasks, they remain susceptible to adversarial attacks. Existing white-box attacks on image-to-text models require access to the architecture, gradients, and parameters of the target model, resulting in low practicality. Although the recently proposed gray-box attacks have improved practicality, they suffer from semantic loss during the training process, which limits their targeted attack performance. To advance adversarial attacks of image-to-text models, this paper focuses on a challenging scenario: decision-based black-box targeted attacks where the attackers only have access to the final output text and aim to perform targeted attacks. Specifically, we formulate the decision-based black-box targeted attack as a large-scale optimization problem. To efficiently solve the optimization…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning