Block-wise Codeword Embedding for Reliable Multi-bit Text Watermarking

Joeun Kim; HoEun Kim; Dongsup Jin; and Young-Sik Kim

arXiv:2605.00348·cs.CR·May 4, 2026

Block-wise Codeword Embedding for Reliable Multi-bit Text Watermarking

Joeun Kim, HoEun Kim, Dongsup Jin, and Young-Sik Kim

PDF

TL;DR

This paper introduces BREW, a novel multi-bit watermarking framework for LLMs that significantly improves reliability by shifting from decoding to designated verification, reducing false positives.

Contribution

BREW presents a two-stage verification process that overcomes false positive issues in existing ECC-based watermarking methods, enabling reliable detection under local edits.

Findings

01

BREW achieves a TPR of 0.965 and an FPR of 0.02 under 10% synonym substitution.

02

Existing ECC-based extractors suffer from catastrophic false positives, which BREW effectively mitigates.

03

The framework is model-agnostic and theoretically grounded, scalable for forensic deployment.

Abstract

Recent multi-bit watermarking methods for large language models (LLMs) prioritize capacity over reliability, often conflating decoding with detection. Our analysis reveals that existing ECC-based extractors suffer from catastrophic false positive rates (FPR), and applying rejection thresholds merely collapses detection sensitivity (TPR) to random guessing. To resolve this structural limitation, we propose \textbf{BREW} (Block-wise Reliable Embedding for Watermarking), a framework shifting the paradigm to \emph{designated verification}. BREW employs a two-stage mechanism: (i) \textbf{blind message estimation} via independent block voting, followed by (ii) \textbf{window-shifting verification} that rigorously validates the payload against local edits. Experiments demonstrate that BREW achieves a TPR of 0.965 with an FPR of 0.02 under 10\% synonym substitution, demonstrating that the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.