Watermark-based Attribution of AI-Generated Content

Zhengyuan Jiang; Moyang Guo; Yuepeng Hu; Yupu Wang; Neil Zhenqiang Gong

arXiv:2404.04254·cs.CR·January 28, 2026·6 cites

Watermark-based Attribution of AI-Generated Content

Zhengyuan Jiang, Moyang Guo, Yuepeng Hu, Yupu Wang, Neil Zhenqiang Gong

PDF

Open Access 1 Repo 3 Reviews

TL;DR

This paper introduces a systematic approach for user-level attribution of AI-generated content using unique watermarks, combining theoretical analysis and empirical validation to optimize detection performance and robustness.

Contribution

It presents the first comprehensive study on watermark-based user attribution for AI content, including theoretical bounds and an optimized watermark selection strategy.

Findings

01

Attribution remains accurate under common post-processing techniques.

02

Watermark-based attribution is robust against limited-query black-box attacks.

03

Theoretical bounds guide optimal watermark selection for improved attribution.

Abstract

Several companies have deployed watermark-based detection to identify AI-generated content. However, attribution--the ability to trace back to the user of a generative AI (GenAI) service who created a given AI-generated content--remains largely unexplored despite its growing importance. In this work, we aim to bridge this gap by conducting the first systematic study on watermark-based, user-level attribution of AI-generated content. Our key idea is to assign a unique watermark to each user of the GenAI service and embed this watermark into the AI-generated content created by that user. Attribution is then performed by identifying the user whose watermark best matches the one extracted from the given content. This approach, however, faces a key challenge: How should watermarks be selected for users to maximize attribution performance? To address the challenge, we first theoretically…

Peer Reviews

Decision·ICLR 2026 Poster

Reviewer 01Rating 6Confidence 3

Strengths

- The paper looks at watermark detection and specifically attribution to the users, which is under explored in the literature. - The paper provides theoretical analysis on the upper and lower bounds on the detection rate. - Empirical results seem to suggest that the proposed method preserves utility and robustness.

Weaknesses

- The motivation of attribution of ai-generated images to user is unclear. I can understand why detection of ai-generated images is important and that is a main motivation of inserting watermark for ai generated content. But it's unclear to me why attribution to users is an important, practical concern. Some discussion on this would be important. - Section 7.1, comparison with non user specific watermark is missing. It would be interesting to study whether such robustness to post-processing is p

Reviewer 02Rating 2Confidence 4

Strengths

Pros: In practical scenarios, provenance of data and its attribution can be assigned to a user.

Weaknesses

Cons: 1. The paper does not discuss privacy at all. Such attribution discloses privacy and thus other sensitive information. The watermarks can lead to replay attacks - malicious users can attribute the watermark of one user to an image that they did not generate. 2. The paper does not look into simple mechanisms - such as hashing schemes - why such hashes of enough size not be used as watermarks? There have been work in security community. So the watermark selection problem for scalability ac

Reviewer 03Rating 6Confidence 4

Strengths

The paper presents a cool idea, and it is well written. The proposed idea seems to be easily implemented. Theoretical results are provided to support the numerical findings.

Weaknesses

The definition of tau is not intuitive. How to select tau seems to be a tricky question -- an absolutely large tau may rule out the possibility of detecting the watermark, whereas a small tau may render multiple i's satisfying the TDR condition, making the selection of i as in the TAR stage sensitive to ties and run-ups. This becomes more challenging when the dimension of watermark is long (e.g., 64-bit, as mentioned by the authors). The results seem to show that when tau increases, both TDR and

Code & Models

Repositories

zhengyuan-jiang/Watermark-Library
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Steganography and Watermarking Techniques · Image Processing and 3D Reconstruction · Digital and Cyber Forensics

Methodstravel james · Sparse Evolutionary Training