WorldCup Sampling for Multi-bit LLM Watermarking
Yidan Wang, Yubing Ren, Yanan Cao, Li Guo

TL;DR
This paper introduces WorldCup, a novel multi-bit watermarking framework for LLMs that models sampling as a communication channel, improving robustness, quality, and capacity over prior methods.
Contribution
WorldCup models LLM sampling as a structured communication channel and uses hierarchical competition and entropy-aware modulation for improved watermarking.
Findings
Outperforms prior methods in capacity, robustness, and text quality.
Balances message detectability and decoding efficiency.
Provides a scalable, principled approach for multi-bit watermarking.
Abstract
As large language models (LLMs) generate increasingly human-like text, watermarking has emerged as a promising solution for reliable attribution beyond mere detection. While multi-bit watermarking enables richer provenance encoding, existing approaches typically extend zero-bit watermarking schemes by introducing static logit perturbations and counting-based decoding strategies, which can degrade text quality and compromise decoding robustness as the payload increases. In this paper, we propose WorldCup, a multi-bit watermarking framework for LLMs that models the sampling process as a structured communication channel and embeds message bits through a hierarchical competition mechanism guided by complementary signals. Moreover, WorldCup incorporates entropy-aware modulation to preserve generation quality and enables robust message recovery via confidence-aware decoding that accounts for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
