WaterPool: A Watermark Mitigating Trade-offs among Imperceptibility, Efficacy and Robustness
Baizhou Huang, Xiaojun Wan

TL;DR
WaterPool introduces a novel key module for watermarking in large language models, effectively balancing imperceptibility, efficacy, and robustness, and improving existing methods without trade-offs.
Contribution
The paper proposes WaterPool, a new key module that enhances watermarking techniques by addressing trade-offs among imperceptibility, efficacy, and robustness in LLMs.
Findings
WaterPool significantly improves watermarking performance (+12.73% for KGW, +20.27% for EXP, +7.27% for ITS).
WaterPool preserves key sampling space for imperceptibility and uses semantics-based search for better key restoration.
WaterPool can be integrated with most watermarking methods as a plug-in.
Abstract
With the increasing use of large language models (LLMs) in daily life, concerns have emerged regarding their potential misuse and societal impact. Watermarking is proposed to trace the usage of specific models by injecting patterns into their generated texts. An ideal watermark should produce outputs that are nearly indistinguishable from those of the original LLM (imperceptibility), while ensuring a high detection rate (efficacy), even when the text is partially altered (robustness). Despite many methods having been proposed, none have simultaneously achieved all three properties, revealing an inherent trade-off. This paper utilizes a key-centered scheme to unify existing watermarking techniques by decomposing a watermark into two distinct modules: a key module and a mark module. Through this decomposition, we demonstrate for the first time that the key module significantly contributes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Steganography and Watermarking Techniques · Internet Traffic Analysis and Secure E-voting
