TrojanEdit: Multimodal Backdoor Attack Against Image Editing Model

Ji Guo; Peihong Chen; Wenbo Jiang; Xiaolei Wen; Jiaming He; Jiachen Li; Guoming Lu; Aiguo Chen; and Hongwei Li

arXiv:2411.14681·cs.CR·June 2, 2025

TrojanEdit: Multimodal Backdoor Attack Against Image Editing Model

Ji Guo, Peihong Chen, Wenbo Jiang, Xiaolei Wen, Jiaming He, Jiachen Li, Guoming Lu, Aiguo Chen, and Hongwei Li

PDF

Open Access

TL;DR

This paper introduces TrojanEdit, a novel backdoor attack framework for multimodal image editing models that effectively embeds triggers from both text and visual inputs, maintaining model functionality and achieving high attack success.

Contribution

It is the first to study backdoor attacks on multimodal diffusion-based image editing models and proposes a method to balance triggers from different modalities during training.

Findings

01

TrojanEdit achieves high attack success rates with balanced multimodal triggers.

02

The framework preserves the normal editing capabilities of the model.

03

It demonstrates effectiveness across multiple image editing models.

Abstract

Multimodal diffusion models for image editing generate outputs conditioned on both textual instructions and visual inputs, aiming to modify target regions while preserving the rest of the image. Although diffusion models have been shown to be vulnerable to backdoor attacks, existing efforts mainly focus on unimodal generative models and fail to address the unique challenges in multimodal image editing. In this paper, we present the first study of backdoor attacks on multimodal diffusion-based image editing models. We investigate the use of both textual and visual triggers to embed a backdoor that achieves high attack success rates while maintaining the model's normal functionality. However, we identify a critical modality bias. Simply combining triggers from different modalities leads the model to primarily rely on the stronger one, often the visual modality, which results in a loss of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCell Image Analysis Techniques · Advanced Steganography and Watermarking Techniques · Generative Adversarial Networks and Image Synthesis