Multi-use LLM Watermarking and the False Detection Problem
Zihao Fu, Chris Russell

TL;DR
This paper addresses the false detection problem in LLM watermarking by proposing a dual watermarking method that jointly encodes detection and identification, reducing false positives and maintaining accuracy.
Contribution
The paper introduces a novel dual watermarking approach that combines detection and user identification, solving the false detection issue in LLM watermarking.
Findings
Dual watermarking reduces false positive rates significantly.
Theoretical analysis explains the causes of false detection.
Experimental results confirm the effectiveness of the proposed method.
Abstract
Digital watermarking is a promising solution for mitigating some of the risks arising from the misuse of automatically generated text. These approaches either embed non-specific watermarks to allow for the detection of any text generated by a particular sampler, or embed specific keys that allow the identification of the LLM user. However, simultaneously using the same embedding for both detection and user identification leads to a false detection problem, whereby, as user capacity grows, unwatermarked text is increasingly likely to be falsely detected as watermarked. Through theoretical analysis, we identify the underlying causes of this phenomenon. Building on these insights, we propose Dual Watermarking which jointly encodes detection and identification watermarks into generated text, significantly reducing false positives while maintaining high detection accuracy. Our experimental…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Steganography and Watermarking Techniques · Digital Media Forensic Detection · Physical Unclonable Functions (PUFs) and Hardware Security
