Optimal Estimation of Watermark Proportions in Hybrid AI-Human Texts

Xiang Li; Garrett Wen; Weiqing He; Jiayuan Wu; Qi Long; Weijie J. Su

arXiv:2506.22343·stat.ML·June 30, 2025

Optimal Estimation of Watermark Proportions in Hybrid AI-Human Texts

Xiang Li, Garrett Wen, Weiqing He, Jiayuan Wu, Qi Long, Weijie J. Su

PDF

Open Access

TL;DR

This paper develops optimal methods for estimating the proportion of watermarked content in texts that are a mixture of human and AI-generated text, addressing a key challenge in watermark detection.

Contribution

It introduces identifiable estimation techniques for watermark proportions in mixed texts using pivotal statistics, with theoretical bounds and practical algorithms.

Findings

01

Estimators achieve minimax optimality.

02

High accuracy on synthetic and real data.

03

Applicable to popular watermarking schemes.

Abstract

Text watermarks in large language models (LLMs) are an increasingly important tool for detecting synthetic text and distinguishing human-written content from LLM-generated text. While most existing studies focus on determining whether entire texts are watermarked, many real-world scenarios involve mixed-source texts, which blend human-written and watermarked content. In this paper, we address the problem of optimally estimating the watermark proportion in mixed-source texts. We cast this problem as estimating the proportion parameter in a mixture model based on \emph{pivotal statistics}. First, we show that this parameter is not even identifiable in certain watermarking schemes, let alone consistently estimable. In stark contrast, for watermarking methods that employ continuous pivotal statistics for detection, we demonstrate that the proportion parameter is identifiable under mild…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Steganography and Watermarking Techniques · Handwritten Text Recognition Techniques