Editing Away the Evidence: Diffusion-Based Image Manipulation and the Failure Modes of Robust Watermarking
Qian Qi, Jiangyun Tang, Jim Lee, Emily Davis, Finn Carter

TL;DR
This paper analyzes how diffusion-based image editing can unintentionally remove or weaken robust watermarks, challenging their reliability for content protection and proposing principles for more resilient watermarking methods.
Contribution
It provides a unified theoretical and empirical framework to understand the failure modes of watermarking under diffusion editing and offers insights for designing more robust schemes.
Findings
Diffusion editing can significantly degrade watermark signals.
Theoretical bounds show conditions where watermark recovery is impossible.
Routine edits often reduce watermark detectability.
Abstract
Robust invisible watermarks are widely used to support copyright protection, content provenance, and accountability by embedding hidden signals designed to survive common post-processing operations. However, diffusion-based image editing introduces a fundamentally different class of transformations: it injects noise and reconstructs images through a powerful generative prior, often altering semantic content while preserving photorealism. In this paper, we provide a unified theoretical and empirical analysis showing that non-adversarial diffusion editing can unintentionally degrade or remove robust watermarks. We model diffusion editing as a stochastic transformation that progressively contracts off-manifold perturbations, causing the low-amplitude signals used by many watermarking schemes to decay. Our analysis derives bounds on watermark signal-to-noise ratio and mutual information…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Steganography and Watermarking Techniques · Digital Media Forensic Detection · Generative Adversarial Networks and Image Synthesis
