An Undetectable Watermark for Generative Image Models

Sam Gunn; Xuandong Zhao; Dawn Song

arXiv:2410.07369·cs.CR·April 23, 2025

An Undetectable Watermark for Generative Image Models

Sam Gunn, Xuandong Zhao, Dawn Song

PDF

Open Access 1 Repo 3 Reviews

TL;DR

This paper introduces the first undetectable watermarking scheme for generative image models that maintains image quality and is robust against removal attacks, enabling secure attribution of generated images.

Contribution

The authors propose a novel undetectable watermarking method for generative images using pseudorandom error-correcting codes, ensuring both undetectability and robustness.

Findings

01

Watermarks do not degrade image quality.

02

The scheme is robust against removal attacks.

03

It can encode up to 512 bits reliably.

Abstract

We present the first undetectable watermarking scheme for generative image models. Undetectability ensures that no efficient adversary can distinguish between watermarked and un-watermarked images, even after making many adaptive queries. In particular, an undetectable watermark does not degrade image quality under any efficiently computable metric. Our scheme works by selecting the initial latents of a diffusion model using a pseudorandom error-correcting code (Christ and Gunn, 2024), a strategy which guarantees undetectability and robustness. We experimentally demonstrate that our watermarks are quality-preserving and robust using Stable Diffusion 2.1. Our experiments verify that, in contrast to every prior scheme we tested, our watermark does not degrade image quality. Our experiments also demonstrate robustness: existing watermark removal attacks fail to remove our watermark from…

Peer Reviews

Decision·ICLR 2025 Poster

Reviewer 01Rating 8Confidence 5

Strengths

1. This paper provides an undetectable image watermark method, which is novel. 2. This paper provides sufficient theoretical and empirical support.

Weaknesses

1. For Figure 2, this paper evaluates the undetectability of different watermarking methods. Could authors specify how many different keys other watermarking methods use? And if increasing the key numbers will increase the undetectability? 2. PRC.Encode_k is interesting, but it would be good for authors to explain how to sample a PRC codeword using PRC.Encode_k in more detail. It should be an important part of this paper, but only Algorithms 1 and 2 in the Appendix are not enough for the reade

Reviewer 02Rating 6Confidence 4

Strengths

1. This paper is well-written and easy to follow. 2. The proposed method maintains image quality by embedding watermarks without degrading metrics like FID, CLIP, and Inception Score. 3. The watermarking method can be seamlessly integrated into existing diffusion model frameworks.

Weaknesses

1. The idea of modifying the initial noise for diffusion is not new. And this method changes initial noise a lot compared to Tree-Ring watermark. 2. This watermark is not as robust as current watermarks such as StegaStamp and Gaussian shading. 3. No significant improvements on metrics.

Reviewer 03Rating 6Confidence 4

Strengths

1. The paper introduces the first undetectable watermark. 2. The authors compare the proposed method with several watermarking baselines. 3. The robustness of the proposed method is evaluated with several perturbations.

Weaknesses

1. The experiments are not adequate for supporting that the image quality of the proposed method outperforms existing ones. 2. The robuseness of the propsed method is not that good from the results. 3. The proposed watermarking method hugely change the image compared with the original one without watermark. 4. The evaluation for undetectability is not convincing.

Code & Models

Repositories

xuandongzhao/prc-watermark
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Steganography and Watermarking Techniques · Chaos-based Image/Signal Encryption · Computer Graphics and Visualization Techniques

MethodsDiffusion