DeMark: A Query-Free Black-Box Attack on Deepfake Watermarking Defenses

Wei Song; Zhenchang Xing; Liming Zhu; Yulei Sui; Jingling Xue

arXiv:2601.16473·cs.CR·January 26, 2026

DeMark: A Query-Free Black-Box Attack on Deepfake Watermarking Defenses

Wei Song, Zhenchang Xing, Liming Zhu, Yulei Sui, Jingling Xue

PDF

Open Access

TL;DR

DeMark is a novel query-free black-box attack that exploits latent-space vulnerabilities in deepfake watermarking schemes, significantly reducing detection accuracy while maintaining image quality, and exposing weaknesses in current defenses.

Contribution

This paper introduces DeMark, the first attack to target deepfake watermarking defenses without queries, revealing their vulnerabilities and challenging their assumed robustness.

Findings

01

DeMark reduces watermark detection accuracy from 100% to 32.9%.

02

Current defenses like super resolution and adversarial training are largely ineffective.

03

Latent-space manipulations can bypass existing watermarking schemes.

Abstract

The rapid proliferation of realistic deepfakes has raised urgent concerns over their misuse, motivating the use of defensive watermarks in synthetic images for reliable detection and provenance tracking. However, this defense paradigm assumes such watermarks are inherently resistant to removal. We challenge this assumption with DeMark, a query-free black-box attack framework that targets defensive image watermarking schemes for deepfakes. DeMark exploits latent-space vulnerabilities in encoder-decoder watermarking models through a compressive sensing based sparsification process, suppressing watermark signals while preserving perceptual and structural realism appropriate for deepfakes. Across eight state-of-the-art watermarking schemes, DeMark reduces watermark detection accuracy from 100% to 32.9% on average while maintaining natural visual quality, outperforming existing attacks. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Digital Media Forensic Detection · Generative Adversarial Networks and Image Synthesis