Adaptive Text Watermark for Large Language Models
Yepeng Liu, Yuheng Bu

TL;DR
This paper introduces an adaptive watermarking method for large language models that enhances text quality, security, and robustness without prior knowledge of prompts, using entropy-based token selection and semantic scaling.
Contribution
It proposes a novel adaptive watermarking strategy that dynamically adjusts watermark embedding based on token entropy and semantic context, improving over fixed methods.
Findings
Achieves comparable robustness to existing watermark techniques.
Maintains low perplexity similar to unwatermarked models.
Remains secure under various attack scenarios.
Abstract
The advancement of Large Language Models (LLMs) has led to increasing concerns about the misuse of AI-generated text, and watermarking for LLM-generated text has emerged as a potential solution. However, it is challenging to generate high-quality watermarked text while maintaining strong security, robustness, and the ability to detect watermarks without prior knowledge of the prompt or model. This paper proposes an adaptive watermarking strategy to address this problem. To improve the text quality and maintain robustness, we adaptively add watermarking to token distributions with high entropy measured using an auxiliary model and keep the low entropy token distributions untouched. For the sake of security and to further minimize the watermark's impact on text quality, instead of using a fixed green/red list generated from a random secret key, which can be vulnerable to decryption and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInternet Traffic Analysis and Secure E-voting · Privacy-Preserving Technologies in Data · Topic Modeling
