Editing Away the Evidence: Diffusion-Based Image Manipulation and the Failure Modes of Robust Watermarking

Qian Qi; Jiangyun Tang; Jim Lee; Emily Davis; Finn Carter

arXiv:2603.12949·eess.IV·March 16, 2026

Editing Away the Evidence: Diffusion-Based Image Manipulation and the Failure Modes of Robust Watermarking

Qian Qi, Jiangyun Tang, Jim Lee, Emily Davis, Finn Carter

PDF

Open Access

TL;DR

This paper analyzes how diffusion-based image editing can unintentionally remove or weaken robust watermarks, challenging their reliability for content protection and proposing principles for more resilient watermarking methods.

Contribution

It provides a unified theoretical and empirical framework to understand the failure modes of watermarking under diffusion editing and offers insights for designing more robust schemes.

Findings

01

Diffusion editing can significantly degrade watermark signals.

02

Theoretical bounds show conditions where watermark recovery is impossible.

03

Routine edits often reduce watermark detectability.

Abstract

Robust invisible watermarks are widely used to support copyright protection, content provenance, and accountability by embedding hidden signals designed to survive common post-processing operations. However, diffusion-based image editing introduces a fundamentally different class of transformations: it injects noise and reconstructs images through a powerful generative prior, often altering semantic content while preserving photorealism. In this paper, we provide a unified theoretical and empirical analysis showing that non-adversarial diffusion editing can unintentionally degrade or remove robust watermarks. We model diffusion editing as a stochastic transformation that progressively contracts off-manifold perturbations, causing the low-amplitude signals used by many watermarking schemes to decay. Our analysis derives bounds on watermark signal-to-noise ratio and mutual information…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Steganography and Watermarking Techniques · Digital Media Forensic Detection · Generative Adversarial Networks and Image Synthesis