Less is More: Sparse Watermarking in LLMs with Enhanced Text Quality

Duy C. Hoang; Hung T. Q. Le; Rui Chu; Ping Li; Weijie Zhao; Yingjie; Lao; Khoa D. Doan

arXiv:2407.13803·cs.CR·July 22, 2024

Less is More: Sparse Watermarking in LLMs with Enhanced Text Quality

Duy C. Hoang, Hung T. Q. Le, Rui Chu, Ping Li, Weijie Zhao, Yingjie, Lao, Khoa D. Doan

PDF

Open Access 1 Repo

TL;DR

This paper introduces Sparse Watermarking for LLMs, which applies watermarks to select tokens based on POS tags, balancing detection effectiveness with high-quality text generation.

Contribution

It proposes a novel sparse watermarking method that improves text quality while maintaining high detectability compared to existing watermarking techniques.

Findings

01

High detectability of watermarks across tasks

02

Improved text quality over previous methods

03

Effective distribution of watermarked tokens

Abstract

With the widespread adoption of Large Language Models (LLMs), concerns about potential misuse have emerged. To this end, watermarking has been adapted to LLM, enabling a simple and effective way to detect and monitor generated text. However, while the existing methods can differentiate between watermarked and unwatermarked text with high accuracy, they often face a trade-off between the quality of the generated text and the effectiveness of the watermarking process. In this work, we present a novel type of LLM watermark, Sparse Watermark, which aims to mitigate this trade-off by applying watermarks to a small subset of generated tokens distributed across the text. The key strategy involves anchoring watermarked tokens to words that have specific Part-of-Speech (POS) tags. Our experimental results demonstrate that the proposed watermarking scheme achieves high detectability while…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mail-research/sparse-llm-watermarking
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Steganography and Watermarking Techniques · Internet Traffic Analysis and Secure E-voting · Digital Rights Management and Security