WaterMod: Modular Token-Rank Partitioning for Probability-Balanced LLM Watermarking
Shinwoo Park, Hyejin Park, Hyeseon Ahn, Yo-Sub Han

TL;DR
WaterMod introduces a probability-aware modular token partitioning method for LLM watermarking that balances imperceptibility and detectability, enabling both binary and multi-bit provenance verification without sacrificing language generation quality.
Contribution
The paper proposes WaterMod, a novel modular token-rank partitioning scheme that improves watermark robustness and payload capacity while preserving language fluency.
Findings
WaterMod achieves high detection accuracy across tasks.
It maintains language quality comparable to unwatermarked models.
Supports both zero-bit and multi-bit watermarking regimes.
Abstract
Large language models now draft news, legal analyses, and software code with human-level fluency. At the same time, regulations such as the EU AI Act mandate that each synthetic passage carry an imperceptible, machine-verifiable mark for provenance. Conventional logit-based watermarks satisfy this requirement by selecting a pseudorandom green vocabulary at every decoding step and boosting its logits, yet the random split can exclude the highest-probability token and thus erode fluency. WaterMod mitigates this limitation through a probability-aware modular rule. The vocabulary is first sorted in descending model probability; the resulting ranks are then partitioned by the residue rank mod k, which distributes adjacent-and therefore semantically similar-tokens across different classes. A fixed bias of small magnitude is applied to one selected class. In the zero-bit setting (k=2), an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Physical Unclonable Functions (PUFs) and Hardware Security · Scientific Computing and Data Management
