Adaptive Text Watermark for Large Language Models

Yepeng Liu; Yuheng Bu

arXiv:2401.13927·cs.CL·June 11, 2024·2 cites

Adaptive Text Watermark for Large Language Models

Yepeng Liu, Yuheng Bu

PDF

Open Access 1 Repo

TL;DR

This paper introduces an adaptive watermarking method for large language models that enhances text quality, security, and robustness without prior knowledge of prompts, using entropy-based token selection and semantic scaling.

Contribution

It proposes a novel adaptive watermarking strategy that dynamically adjusts watermark embedding based on token entropy and semantic context, improving over fixed methods.

Findings

01

Achieves comparable robustness to existing watermark techniques.

02

Maintains low perplexity similar to unwatermarked models.

03

Remains secure under various attack scenarios.

Abstract

The advancement of Large Language Models (LLMs) has led to increasing concerns about the misuse of AI-generated text, and watermarking for LLM-generated text has emerged as a potential solution. However, it is challenging to generate high-quality watermarked text while maintaining strong security, robustness, and the ability to detect watermarks without prior knowledge of the prompt or model. This paper proposes an adaptive watermarking strategy to address this problem. To improve the text quality and maintain robustness, we adaptively add watermarking to token distributions with high entropy measured using an auxiliary model and keep the low entropy token distributions untouched. For the sake of security and to further minimize the watermark's impact on text quality, instead of using a fixed green/red list generated from a random secret key, which can be vulnerable to decryption and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yepengliu/adaptive-text-watermark
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInternet Traffic Analysis and Secure E-voting · Privacy-Preserving Technologies in Data · Topic Modeling