DERMARK: A Dynamic, Efficient and Robust Multi-bit Watermark for Large Language Models
Qihao Lin, Chen Tang, Lan zhang, Junyang zhang, Xiangyang Li

TL;DR
DERMARK introduces a novel multi-bit watermarking technique for large language models that dynamically segments text during inference, significantly improving efficiency, robustness, and capacity over existing methods.
Contribution
The paper presents DERMARK, a dynamic multi-bit watermarking approach that optimally segments text based on a formal inequality, enhancing robustness and efficiency in LLM watermarking.
Findings
Reduces tokens per watermark bit by 25%
Halves watermark embedding time
Maintains high robustness against text modifications
Abstract
As large language models (LLMs) grow more powerful, concerns over copyright infringement of LLM-generated texts have intensified. LLM watermarking has been proposed to trace unauthorized redistribution or resale of generated content by embedding identifiers within the text. Existing approaches primarily rely on one-bit watermarking, which only verifies whether a text was generated by a specific LLM. In contrast, multi-bit watermarking encodes richer information, enabling the identification of the specific LLM and user involved in generated or distributed content. However, current multi-bit methods directly embed the watermark into the text without considering its watermark capacity, which can result in failures, especially in low-entropy texts. In this paper, we analyze that the watermark embedding follows a normal distribution. We then derive a formal inequality to optimally segment…
Peer Reviews
Decision·Submitted to ICLR 2026
1. Principled theoretical framing (Poisson-binomial → CLT → inequality) that connects token-level probabilities to required segment length. 2. Practical algorithm: online segmentation during inference with negligible extra compute compared to baseline multi-bit methods. Reported embedding overhead is near zero and extraction is efficient enough for practice. 3.Strong empirical gains on tokens-per-bit and robustness to small insertion/deletion attacks across two model families. Table 1 + figu
1. CLT approximation may be unreliable when segments are short (the very regime the method targets), and the paper lacks finite-sample error bounds or bootstrap-style corrections. 2. many heuristics (λ smoothing, β weighting, iterative ϵ updates). The paper reports defaults but more ablations on hyperparameter sensitivity and cross-domain robustness (beyond news-like prompts) would be helpful. 3. the authors justify using Balance-Marking as SOTA and critique MPAC; still, including more rec
1. DERMARK adaptively determines segment boundaries in real time based on token-level statistics, achieving 2–4 fewer tokens per bit at the same detection rate. This dynamic rule substantially enhances embedding efficiency without retraining. 2. The embedding complexity is linear (O(N)) and extraction is O($kL^2$), and test for a large model (LLaMA-2-70B). The method is fully plug-and-play, requiring no fine-tuning or architectural modification. 3. The inclusion of perplexity (PPL) experiments
1. While Appendix C discusses MPAC (NAACL 2024) conceptually, the paper still provides no quantitative comparison with recent multi-bit watermarking approaches. Furthermore, I think the method can be extended to multi-bit watermarking methods such as MPAC. A discussion of this point would greatly strengthen the paper. 2. The robustness tests remain restricted to random insertion/deletion attacks. No experiments address paraphrasing, shuffling, gradient-based, or LLM-assisted removal attacks, w
1. The paper explains why multi-bit watermarking (beyond one-bit detection) is needed for fine-grained attribution (LLM/user) and why fixed-length segmentation can fail, especially on low-entropy text. 2. Derives an inequality from a CLT-style analysis that treats aligned-token proportion as approximately normal; this enables an online, per-bit stopping rule during generation. 3. For matched detection rates, DERMARK uses fewer tokens per embedded bit, with further gains on low-entropy subsets.
1. The experimental setup is outdated. There are many new multi-bit watermarking works. However, this paper only uses one 2023 paper as a baseline. Comparing with more extensive, recent baselines will strengthen the claims. 2. Considering most application scenarios of the LLM watermark are under the chat. Evaluating the performance on long-form QA dataset and instructed models (e.g., Llama-3.1-8B-Instruct) is necessary. 3. Robustness focuses on random insert/delete at 5–10%, which is limited
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Steganography and Watermarking Techniques · Internet Traffic Analysis and Secure E-voting · Chaos-based Image/Signal Encryption
