Watermark-based Attribution of AI-Generated Content
Zhengyuan Jiang, Moyang Guo, Yuepeng Hu, Yupu Wang, Neil Zhenqiang Gong

TL;DR
This paper introduces a systematic approach for user-level attribution of AI-generated content using unique watermarks, combining theoretical analysis and empirical validation to optimize detection performance and robustness.
Contribution
It presents the first comprehensive study on watermark-based user attribution for AI content, including theoretical bounds and an optimized watermark selection strategy.
Findings
Attribution remains accurate under common post-processing techniques.
Watermark-based attribution is robust against limited-query black-box attacks.
Theoretical bounds guide optimal watermark selection for improved attribution.
Abstract
Several companies have deployed watermark-based detection to identify AI-generated content. However, attribution--the ability to trace back to the user of a generative AI (GenAI) service who created a given AI-generated content--remains largely unexplored despite its growing importance. In this work, we aim to bridge this gap by conducting the first systematic study on watermark-based, user-level attribution of AI-generated content. Our key idea is to assign a unique watermark to each user of the GenAI service and embed this watermark into the AI-generated content created by that user. Attribution is then performed by identifying the user whose watermark best matches the one extracted from the given content. This approach, however, faces a key challenge: How should watermarks be selected for users to maximize attribution performance? To address the challenge, we first theoretically…
Peer Reviews
Decision·ICLR 2026 Poster
- The paper looks at watermark detection and specifically attribution to the users, which is under explored in the literature. - The paper provides theoretical analysis on the upper and lower bounds on the detection rate. - Empirical results seem to suggest that the proposed method preserves utility and robustness.
- The motivation of attribution of ai-generated images to user is unclear. I can understand why detection of ai-generated images is important and that is a main motivation of inserting watermark for ai generated content. But it's unclear to me why attribution to users is an important, practical concern. Some discussion on this would be important. - Section 7.1, comparison with non user specific watermark is missing. It would be interesting to study whether such robustness to post-processing is p
Pros: In practical scenarios, provenance of data and its attribution can be assigned to a user.
Cons: 1. The paper does not discuss privacy at all. Such attribution discloses privacy and thus other sensitive information. The watermarks can lead to replay attacks - malicious users can attribute the watermark of one user to an image that they did not generate. 2. The paper does not look into simple mechanisms - such as hashing schemes - why such hashes of enough size not be used as watermarks? There have been work in security community. So the watermark selection problem for scalability ac
The paper presents a cool idea, and it is well written. The proposed idea seems to be easily implemented. Theoretical results are provided to support the numerical findings.
The definition of tau is not intuitive. How to select tau seems to be a tricky question -- an absolutely large tau may rule out the possibility of detecting the watermark, whereas a small tau may render multiple i's satisfying the TDR condition, making the selection of i as in the TAR stage sensitive to ties and run-ups. This becomes more challenging when the dimension of watermark is long (e.g., 64-bit, as mentioned by the authors). The results seem to show that when tau increases, both TDR and
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Steganography and Watermarking Techniques · Image Processing and 3D Reconstruction · Digital and Cyber Forensics
Methodstravel james · Sparse Evolutionary Training
