Optimized Couplings for Watermarking Large Language Models
Dor Tsur, Carol Xuan Long, Claudio Mayrink Verdun, Hsiang Hsu, Haim Permuter, Flavio P. Calmon

TL;DR
This paper analyzes the fundamental trade-offs in watermarking large language models, proposing an optimal coupling strategy that balances detection accuracy and text quality preservation, with theoretical and numerical validation.
Contribution
It introduces a novel coupling-based watermarking scheme optimized for worst-case LLM distributions, providing closed-form detection rates and practical comparisons.
Findings
Optimal coupling strategy enhances watermark detection
Proposed scheme approaches theoretical detection limits
Numerical results validate effectiveness on synthetic and real data
Abstract
Large-language models (LLMs) are now able to produce text that is, in many cases, seemingly indistinguishable from human-generated content. This has fueled the development of watermarks that imprint a ``signal'' in LLM-generated text with minimal perturbation of an LLM's output. This paper provides an analysis of text watermarking in a one-shot setting. Through the lens of hypothesis testing with side information, we formulate and analyze the fundamental trade-off between watermark detection power and distortion in generated textual quality. We argue that a key component in watermark design is generating a coupling between the side information shared with the watermark detector and a random partition of the LLM vocabulary. Our analysis identifies the optimal coupling and randomization strategy under the worst-case LLM next-token distribution that satisfies a min-entropy constraint. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Steganography and Watermarking Techniques · Internet Traffic Analysis and Secure E-voting · Chaos-based Image/Signal Encryption
