BiMark: Unbiased Multilayer Watermarking for Large Language Models

Xiaoyan Feng; He Zhang; Yanjun Zhang; Leo Yu Zhang; Shirui Pan

arXiv:2506.21602·cs.CL·August 26, 2025

BiMark: Unbiased Multilayer Watermarking for Large Language Models

Xiaoyan Feng, He Zhang, Yanjun Zhang, Leo Yu Zhang, Shirui Pan

PDF

Open Access 1 Video

TL;DR

BiMark introduces an innovative multilayer watermarking framework for large language models that balances text quality, detection robustness, and message capacity, outperforming existing methods in extraction rate and maintaining downstream task performance.

Contribution

The paper presents BiMark, a novel multilayer watermarking approach with unbiased reweighting and multi-bit encoding, advancing the state-of-the-art in LLM text authentication.

Findings

01

Up to 30% higher extraction rates for short texts.

02

Lower perplexity indicates better text quality.

03

Maintains performance on downstream tasks like summarization and translation.

Abstract

Recent advances in Large Language Models (LLMs) have raised urgent concerns about LLM-generated text authenticity, prompting regulatory demands for reliable identification mechanisms. Although watermarking offers a promising solution, existing approaches struggle to simultaneously achieve three critical requirements: text quality preservation, model-agnostic detection, and message embedding capacity, which are crucial for practical implementation. To achieve these goals, the key challenge lies in balancing the trade-off between text quality preservation and message embedding capacity. To address this challenge, we propose BiMark, a novel watermarking framework that achieves these requirements through three key innovations: (1) a bit-flip unbiased reweighting mechanism enabling model-agnostic detection, (2) a multilayer architecture enhancing detectability without compromising generation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

BiMark: Unbiased Multilayer Watermarking for Large Language Models· slideslive

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Big Data and Digital Economy