An Undetectable Watermark for Generative Image Models
Sam Gunn, Xuandong Zhao, Dawn Song

TL;DR
This paper introduces the first undetectable watermarking scheme for generative image models that maintains image quality and is robust against removal attacks, enabling secure attribution of generated images.
Contribution
The authors propose a novel undetectable watermarking method for generative images using pseudorandom error-correcting codes, ensuring both undetectability and robustness.
Findings
Watermarks do not degrade image quality.
The scheme is robust against removal attacks.
It can encode up to 512 bits reliably.
Abstract
We present the first undetectable watermarking scheme for generative image models. Undetectability ensures that no efficient adversary can distinguish between watermarked and un-watermarked images, even after making many adaptive queries. In particular, an undetectable watermark does not degrade image quality under any efficiently computable metric. Our scheme works by selecting the initial latents of a diffusion model using a pseudorandom error-correcting code (Christ and Gunn, 2024), a strategy which guarantees undetectability and robustness. We experimentally demonstrate that our watermarks are quality-preserving and robust using Stable Diffusion 2.1. Our experiments verify that, in contrast to every prior scheme we tested, our watermark does not degrade image quality. Our experiments also demonstrate robustness: existing watermark removal attacks fail to remove our watermark from…
Peer Reviews
Decision·ICLR 2025 Poster
1. This paper provides an undetectable image watermark method, which is novel. 2. This paper provides sufficient theoretical and empirical support.
1. For Figure 2, this paper evaluates the undetectability of different watermarking methods. Could authors specify how many different keys other watermarking methods use? And if increasing the key numbers will increase the undetectability? 2. PRC.Encode_k is interesting, but it would be good for authors to explain how to sample a PRC codeword using PRC.Encode_k in more detail. It should be an important part of this paper, but only Algorithms 1 and 2 in the Appendix are not enough for the reade
1. This paper is well-written and easy to follow. 2. The proposed method maintains image quality by embedding watermarks without degrading metrics like FID, CLIP, and Inception Score. 3. The watermarking method can be seamlessly integrated into existing diffusion model frameworks.
1. The idea of modifying the initial noise for diffusion is not new. And this method changes initial noise a lot compared to Tree-Ring watermark. 2. This watermark is not as robust as current watermarks such as StegaStamp and Gaussian shading. 3. No significant improvements on metrics.
1. The paper introduces the first undetectable watermark. 2. The authors compare the proposed method with several watermarking baselines. 3. The robustness of the proposed method is evaluated with several perturbations.
1. The experiments are not adequate for supporting that the image quality of the proposed method outperforms existing ones. 2. The robuseness of the propsed method is not that good from the results. 3. The proposed watermarking method hugely change the image compared with the original one without watermark. 4. The evaluation for undetectability is not convincing.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Steganography and Watermarking Techniques · Chaos-based Image/Signal Encryption · Computer Graphics and Visualization Techniques
MethodsDiffusion
