Diffusion-Based Image Editing: An Unforeseen Adversary to Robust Invisible Watermarks

Wenkai Fu; Finn Carter; Yue Wang; Emily Davis; Bo Zhang

arXiv:2511.05598·cs.CR·November 11, 2025

Diffusion-Based Image Editing: An Unforeseen Adversary to Robust Invisible Watermarks

Wenkai Fu, Finn Carter, Yue Wang, Emily Davis, Bo Zhang

PDF

Open Access

TL;DR

This paper demonstrates that diffusion-based image editing models can effectively remove or distort robust invisible watermarks, exposing a fundamental vulnerability in current watermarking techniques against generative AI transformations.

Contribution

The paper provides a theoretical and empirical analysis of how diffusion models degrade watermarks and introduces diffusion-driven attacks that can erase watermarks with high visual fidelity.

Findings

01

Diffusion models can nearly eliminate watermark signals in images.

02

Diffusion-based attacks drastically reduce watermark decoding accuracy.

03

Current watermarking methods are vulnerable to generative model-based edits.

Abstract

Robust invisible watermarking aims to embed hidden messages into images such that they survive various manipulations while remaining imperceptible. However, powerful diffusion-based image generation and editing models now enable realistic content-preserving transformations that can inadvertently remove or distort embedded watermarks. In this paper, we present a theoretical and empirical analysis demonstrating that diffusion-based image editing can effectively break state-of-the-art robust watermarks designed to withstand conventional distortions. We analyze how the iterative noising and denoising process of diffusion models degrades embedded watermark signals, and provide formal proofs that under certain conditions a diffusion model's regenerated image retains virtually no detectable watermark information. Building on this insight, we propose a diffusion-driven attack that uses…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Steganography and Watermarking Techniques · Adversarial Robustness in Machine Learning