Boosting Active Defense Persistence: A Two-Stage Defense Framework Combining Interruption and Poisoning Against Deepfake

Hongrui Zheng; Yuezun Li; Liejun Wang; Yunfeng Diao; Zhiqing Guo

arXiv:2508.07795·cs.CV·March 17, 2026

Boosting Active Defense Persistence: A Two-Stage Defense Framework Combining Interruption and Poisoning Against Deepfake

Hongrui Zheng, Yuezun Li, Liejun Wang, Yunfeng Diao, Zhiqing Guo

PDF

Open Access

TL;DR

This paper introduces a Two-Stage Defense Framework (TSDF) that combines interruption and poisoning techniques to create persistent active defenses against deepfake attacks, preventing attackers from retraining their models effectively.

Contribution

The paper proposes a novel two-stage defense framework using dual-function adversarial perturbations to both distort deepfakes and poison training data, enhancing defense persistence.

Findings

01

Traditional interruption methods degrade under adversarial retraining.

02

TSDF significantly improves defense persistence against model retraining.

03

Experimental results demonstrate strong dual defense capabilities.

Abstract

Active defense strategies have been developed to counter the threat of deepfake technology. However, a primary challenge is their lack of persistence, as their effectiveness is often short-lived. Attackers can bypass these defenses by simply collecting protected samples and retraining their models. This means that static defenses inevitably fail when attackers retrain their models, which severely limits practical use. We argue that an effective defense not only distorts forged content but also blocks the model's ability to adapt, which occurs when attackers retrain their models on protected images. To achieve this, we propose an innovative Two-Stage Defense Framework (TSDF). Benefiting from the intensity separation mechanism designed in this paper, the framework uses dual-function adversarial perturbations to perform two roles. First, it can directly distort the forged results. Second,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Security and Verification in Computing · Advanced Malware Detection Techniques