MirrorMark: Generalizable Mirrored Sampling for Multi-bit LLM Watermarking
Ya Jiang, Massieh Kordi Boroujeny, Surender Suresh Kumar, Kai Zeng

TL;DR
MirrorMark introduces a novel, generalizable method for multi-bit LLM watermarking that preserves text quality and enhances detectability through a mapping-centric approach and a balanced scheduling mechanism.
Contribution
It proposes MirrorMark, a mapping-centric watermarking technique that separates symbol mapping from sampling, enabling robust multi-bit watermarking without distorting the generation distribution.
Findings
MirrorMark achieves high detectability and bit accuracy.
It maintains text quality comparable to non-watermarked outputs.
Theoretical EER analyses support its effectiveness.
Abstract
As large language models (LLMs) become integral to applications such as question answering and content creation, reliable content attribution has become increasingly important. Watermarking is a promising approach, but most existing methods either provide only binary signals or achieve multi-bit embedding by distorting the generation distribution. We propose MirrorMark, a generalizable mapping-centric approach for multi-bit LLM watermarking. MirrorMark separates the symbol mapping rule from the base watermarking sampler and maps each symbol to a mod-1 mirroring transformation of a detector-reproducible pseudorandom object, such as sampling values or permutation ranks. A binary-tokenizer analysis shows that complementary mappings yield larger matched--mismatched score gaps than independent-key or shift-based mappings. When composed with a distortion-free base sampler, MirrorMark…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
