Shallow Diffuse: Robust and Invisible Watermarking through Low-Dimensional Subspaces in Diffusion Models
Wenda Li, Huijie Zhang, Qing Qu

TL;DR
Shallow Diffuse introduces a novel watermarking method for diffusion models that embeds invisible, robust watermarks by exploiting low-dimensional subspaces, improving detection and consistency of AI-generated images.
Contribution
It proposes a decoupled watermarking technique leveraging low-dimensional subspaces in diffusion models, enhancing robustness and invisibility compared to prior methods.
Findings
Outperforms existing watermarking methods in robustness.
Enhances watermark detectability and image generation consistency.
Theoretical and empirical validation supports effectiveness.
Abstract
The widespread use of AI-generated content from diffusion models has raised significant concerns regarding misinformation and copyright infringement. Watermarking is a crucial technique for identifying these AI-generated images and preventing their misuse. In this paper, we introduce Shallow Diffuse, a new watermarking technique that embeds robust and invisible watermarks into diffusion model outputs. Unlike existing approaches that integrate watermarking throughout the entire diffusion sampling process, Shallow Diffuse decouples these steps by leveraging the presence of a low-dimensional subspace in the image generation process. This method ensures that a substantial portion of the watermark lies in the null space of this subspace, effectively separating it from the image generation process. Our theoretical and empirical analyses show that this decoupling strategy greatly enhances the…
Peer Reviews
Decision·Submitted to ICLR 2025
1. Shallow Diffuse's primary strength lies in its utilization of the low-rank property of the PMP's Jacobian matrix to minimize the visual impact of watermarks, thereby attaining visual consistency within a training-free watermark framework. 2. The injected watermark is more robust against several image distortions than existing baselines.
1. The presentation of this paper is poor, for instance, the ablation studies (Appendix C) and index of experimental results (Table 4) are incomplete. Therefore, this leads to a shortage of critical ablation studies. 2. What is the performance of multi-key identification, specifically, is it possible for the Shallow Diffuse to inject multiple watermarks and distinguish between them? 3. The image distortions are less than that in previous studies, such as Tree-Ring, where they apply 6 distortions
The originality of this paper is not great, but its quality, clarity and significance are good. It has the support of rich theoretical basis and has advantages in theoretical proof.
The table in the paper is not very well drawn, it is very difficult to read, especially the header. At the same time, the experimental part is not detailed enough. For example, should the comparison method reproduce the results or use the pre-training model?
1. The proposed method has smaller impact on the generated images compared to the existing watermarking methods designed for diffusion models. 2. Experiments are carried on several image-prompt datasets to show the effectiveness of the proposed methods. 3. The robustness of the propsoed method is evaluated.
1. Compared to Tree-Ring, the technical contribution of the proposed method is limited. 2. In the experimental part, the authors mainly compare their method with the watermarking methods that embed watermark into the semantic space like Tree-Ring which changes the image a lot. More other watermarking methods should be evaluated. 3. In the robustness part, the authors only evaluate the robustness of the proposed method on some common perturbation.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Steganography and Watermarking Techniques · Internet Traffic Analysis and Secure E-voting
MethodsDiffusion
