Every Bit, Everywhere, All at Once: A Binomial Multibit LLM Watermark
Thibaud Gloaguen, Robin Staab, Mark Vero, Martin Vechev

TL;DR
This paper introduces a novel binomial encoding approach for multibit LLM watermarking, enabling more complex payloads with higher accuracy and robustness, and proposes new evaluation metrics for watermark effectiveness.
Contribution
It presents a fundamentally new binomial encoding method for multibit watermarking in LLMs, improving message accuracy and robustness over existing baselines.
Findings
Achieves superior message accuracy and robustness compared to 8 baselines.
Effectively encodes larger payloads with low distortion.
Introduces per-bit confidence scoring as a practical evaluation metric.
Abstract
With LLM watermarking already being deployed commercially, practical applications increasingly require multibit watermarks that encode more complex payloads, such as user IDs or timestamps, into the generated text. In this work, we propose a fundamentally new approach for multibit watermarking: introducing binomial encoding to directly encode every bit of the payload at every token position. We complement our approach with a stateful encoder that during generation dynamically redirects encoding pressure toward underencoded bits. Our evaluation against 8 baselines on up to 64-bit payloads shows that our scheme achieves superior message accuracy and robustness, with the gap to baseline methods widening in more relevant settings (i.e., large payloads and low-distortion regimes). At the same time, we challenge prior works' evaluation metrics, highlighting their lack of practical insights,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
