Online LLM watermark detection via e-processes
Weijie Su, Ruodu Wang, Zinan Zhao

TL;DR
This paper introduces a unified e-process-based framework for online watermark detection in LLMs, offering anytime-valid guarantees and adaptable methods to improve detection power.
Contribution
It develops a novel, theoretically grounded approach for online watermark detection using e-processes, applicable to any sequential testing scenario with independent pivotal statistics.
Findings
The framework provides competitive detection performance.
It offers anytime-valid guarantees for online testing.
Proposed methods enhance detection power through adaptivity.
Abstract
Watermarking for large language models (LLMs) has emerged as an effective tool for distinguishing AI-generated text from human-written content. Statistically, watermark schemes induce dependence between generated tokens and a pseudo-random sequence, reducing watermark detection to a hypothesis testing problem on independence. We develop a unified framework for LLM watermark detection based on e-processes, providing anytime-valid guarantees for online testing. We propose various methods to construct empirically adaptive e-processes that can enhance the detection power. The proposed methods are applicable to any sequential testing problem where independent pivotal statistics are available. In addition, theoretical results are established to characterize the power properties of the proposed procedures. Some experiments demonstrate that the proposed framework achieves competitive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
