Provably Robust Multi-bit Watermarking for AI-generated Text

Wenjie Qu; Wengrui Zheng; Tianyang Tao; Dong Yin; Yanze Jiang; Zhihua; Tian; Wei Zou; Jinyuan Jia; Jiaheng Zhang

arXiv:2401.16820·cs.CR·January 29, 2025·5 cites

Provably Robust Multi-bit Watermarking for AI-generated Text

Wenjie Qu, Wengrui Zheng, Tianyang Tao, Dong Yin, Yanze Jiang, Zhihua, Tian, Wei Zou, Jinyuan Jia, Jiaheng Zhang

PDF

Open Access

TL;DR

This paper presents a new provably robust watermarking method for AI-generated text that significantly improves accuracy and robustness in content source tracing, enabling reliable identification of the source even after text edits.

Contribution

The paper introduces a novel watermarking technique based on pseudo-random segment assignment, enhancing robustness and accuracy over existing methods for LLM-generated text.

Findings

01

Achieves 97.6% match rate for 20-bit messages in 200-token texts

02

Outperforms existing baselines in accuracy and robustness

03

Tolerates an average edit distance of 17 per paragraph

Abstract

Large Language Models (LLMs) have demonstrated remarkable capabilities of generating texts resembling human language. However, they can be misused by criminals to create deceptive content, such as fake news and phishing emails, which raises ethical concerns. Watermarking is a key technique to address these concerns, which embeds a message (e.g., a bit string) into a text generated by an LLM. By embedding the user ID (represented as a bit string) into generated texts, we can trace generated texts to the user, known as content source tracing. The major limitation of existing watermarking techniques is that they achieve sub-optimal performance for content source tracing in real-world scenarios. The reason is that they cannot accurately or efficiently extract a long message from a generated text. We aim to address the limitations. In this work, we introduce a new watermarking method for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Steganography and Watermarking Techniques · Chaos-based Image/Signal Encryption · Vehicle License Plate Recognition