TrojanEdit: Multimodal Backdoor Attack Against Image Editing Model
Ji Guo, Peihong Chen, Wenbo Jiang, Xiaolei Wen, Jiaming He, Jiachen Li, Guoming Lu, Aiguo Chen, and Hongwei Li

TL;DR
This paper introduces TrojanEdit, a novel backdoor attack framework for multimodal image editing models that effectively embeds triggers from both text and visual inputs, maintaining model functionality and achieving high attack success.
Contribution
It is the first to study backdoor attacks on multimodal diffusion-based image editing models and proposes a method to balance triggers from different modalities during training.
Findings
TrojanEdit achieves high attack success rates with balanced multimodal triggers.
The framework preserves the normal editing capabilities of the model.
It demonstrates effectiveness across multiple image editing models.
Abstract
Multimodal diffusion models for image editing generate outputs conditioned on both textual instructions and visual inputs, aiming to modify target regions while preserving the rest of the image. Although diffusion models have been shown to be vulnerable to backdoor attacks, existing efforts mainly focus on unimodal generative models and fail to address the unique challenges in multimodal image editing. In this paper, we present the first study of backdoor attacks on multimodal diffusion-based image editing models. We investigate the use of both textual and visual triggers to embed a backdoor that achieves high attack success rates while maintaining the model's normal functionality. However, we identify a critical modality bias. Simply combining triggers from different modalities leads the model to primarily rely on the stronger one, often the visual modality, which results in a loss of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCell Image Analysis Techniques · Advanced Steganography and Watermarking Techniques · Generative Adversarial Networks and Image Synthesis
