Towards Transferable Defense Against Malicious Image Edits

Jie Zhang; Shuai Dong; Shiguang Shan; Xilin Chen

arXiv:2512.14341·cs.CV·March 3, 2026

Towards Transferable Defense Against Malicious Image Edits

Jie Zhang, Shuai Dong, Shiguang Shan, Xilin Chen

PDF

Open Access

TL;DR

This paper introduces TDAE, a bimodal framework combining visual and textual defenses to improve the transferability and robustness of images against malicious edits in diffusion-based systems.

Contribution

The paper proposes TDAE, a novel bimodal defense framework with FDM and DPD, enhancing cross-model immunity against malicious image manipulations.

Findings

01

TDAE outperforms existing methods in cross-model evaluations.

02

FDM effectively regularizes gradients for robustness.

03

DPD improves transferability through iterative embedding refinement.

Abstract

Recent approaches employing imperceptible perturbations in input images have demonstrated promising potential to counter malicious manipulations in diffusion-based image editing systems. However, existing methods suffer from limited transferability in cross-model evaluations. To address this, we propose Transferable Defense Against Malicious Image Edits (TDAE), a novel bimodal framework that enhances image immunity against malicious edits through coordinated image-text optimization. Specifically, at the visual defense level, we introduce FlatGrad Defense Mechanism (FDM), which incorporates gradient regularization into the adversarial objective. By explicitly steering the perturbations toward flat minima, FDM amplifies immune robustness against unseen editing models. For textual enhancement protection, we propose an adversarial optimization paradigm named Dynamic Prompt Defense (DPD),…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection · Advanced Image Processing Techniques