WorldCup Sampling for Multi-bit LLM Watermarking

Yidan Wang; Yubing Ren; Yanan Cao; Li Guo

arXiv:2602.01752·cs.CL·May 11, 2026

WorldCup Sampling for Multi-bit LLM Watermarking

Yidan Wang, Yubing Ren, Yanan Cao, Li Guo

PDF

TL;DR

This paper introduces WorldCup, a novel multi-bit watermarking framework for LLMs that models sampling as a communication channel, improving robustness, quality, and capacity over prior methods.

Contribution

WorldCup models LLM sampling as a structured communication channel and uses hierarchical competition and entropy-aware modulation for improved watermarking.

Findings

01

Outperforms prior methods in capacity, robustness, and text quality.

02

Balances message detectability and decoding efficiency.

03

Provides a scalable, principled approach for multi-bit watermarking.

Abstract

As large language models (LLMs) generate increasingly human-like text, watermarking has emerged as a promising solution for reliable attribution beyond mere detection. While multi-bit watermarking enables richer provenance encoding, existing approaches typically extend zero-bit watermarking schemes by introducing static logit perturbations and counting-based decoding strategies, which can degrade text quality and compromise decoding robustness as the payload increases. In this paper, we propose WorldCup, a multi-bit watermarking framework for LLMs that models the sampling process as a structured communication channel and embeds message bits through a hierarchical competition mechanism guided by complementary signals. Moreover, WorldCup incorporates entropy-aware modulation to preserve generation quality and enables robust message recovery via confidence-aware decoding that accounts for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.