An Entropy-based Text Watermarking Detection Method
Yijian Lu, Aiwei Liu, Dianzhi Yu, Jingjing Li, Irwin King

TL;DR
This paper introduces EWD, an entropy-aware watermark detection method for LLM-generated texts that improves detection in low-entropy scenarios by weighting tokens according to their entropy, without requiring training.
Contribution
The paper proposes a novel entropy-based weighting scheme for watermark detection that enhances performance in low-entropy contexts and is fully automated and training-free.
Findings
EWD outperforms previous methods in low-entropy scenarios.
The method is general and adaptable to texts with varying entropy.
EWD is training-free and fully automated.
Abstract
Text watermarking algorithms for large language models (LLMs) can effectively identify machine-generated texts by embedding and detecting hidden features in the text. Although the current text watermarking algorithms perform well in most high-entropy scenarios, its performance in low-entropy scenarios still needs to be improved. In this work, we opine that the influence of token entropy should be fully considered in the watermark detection process, , the weight of each token during watermark detection should be customized according to its entropy, rather than setting the weights of all tokens to the same value as in previous methods. Specifically, we propose \textbf{E}ntropy-based Text \textbf{W}atermarking \textbf{D}etection (\textbf{EWD}) that gives higher-entropy tokens higher influence weights during watermark detection, so as to better reflect the degree of watermarking.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Steganography and Watermarking Techniques · Vehicle License Plate Recognition · Biometric Identification and Security
