Stealthy Targeted Backdoor Attacks against Image Captioning

Wenshu Fan; Hongwei Li; Wenbo Jiang; Meng Hao; Shui Yu; Xiao Zhang

arXiv:2406.05874·cs.CR·June 11, 2024

Stealthy Targeted Backdoor Attacks against Image Captioning

Wenshu Fan, Hongwei Li, Wenbo Jiang, Meng Hao, Shui Yu, Xiao Zhang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a stealthy backdoor attack method on image captioning models that manipulates object recognition to produce targeted captions, achieving high success rates without affecting normal performance.

Contribution

The paper proposes a novel, stealthier backdoor attack technique that leverages universal perturbations and object placement to manipulate image captions.

Findings

01

High attack success rate achieved

02

Backdoor samples are indistinguishable from clean samples

03

Method bypasses existing defenses

Abstract

In recent years, there has been an explosive growth in multimodal learning. Image captioning, a classical multimodal task, has demonstrated promising applications and attracted extensive research attention. However, recent studies have shown that image caption models are vulnerable to some security threats such as backdoor attacks. Existing backdoor attacks against image captioning typically pair a trigger either with a predefined sentence or a single word as the targeted output, yet they are unrelated to the image content, making them easily noticeable as anomalies by humans. In this paper, we present a novel method to craft targeted backdoor attacks against image caption models, which are designed to be stealthier than prior attacks. Specifically, our method first learns a special trigger by leveraging universal perturbation techniques for object detection, then places the learned…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

fiora6/icbackdoor
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · COVID-19 diagnosis using AI · Human Pose and Action Recognition