Improved Unbiased Watermark for Large Language Models

Ruibo Chen; Yihan Wu; Junfeng Guo; Heng Huang

arXiv:2502.11268·cs.CL·August 7, 2025

Improved Unbiased Watermark for Large Language Models

Ruibo Chen, Yihan Wu, Junfeng Guo, Heng Huang

PDF

Open Access

TL;DR

This paper introduces MCmark, a novel unbiased watermarking method for large language models that improves detectability and robustness without affecting text quality, aiding in authenticating AI-generated content.

Contribution

We propose MCmark, a multi-channel unbiased watermarking technique that significantly enhances detection and robustness over existing methods in large language models.

Findings

01

Over 10% improvement in detectability compared to state-of-the-art methods

02

Preserves original language model distribution

03

Demonstrates robustness across various models

Abstract

As artificial intelligence surpasses human capabilities in text generation, the necessity to authenticate the origins of AI-generated content has become paramount. Unbiased watermarks offer a powerful solution by embedding statistical signals into language model-generated text without distorting the quality. In this paper, we introduce MCmark, a family of unbiased, Multi-Channel-based watermarks. MCmark works by partitioning the model's vocabulary into segments and promoting token probabilities within a selected segment based on a watermark key. We demonstrate that MCmark not only preserves the original distribution of the language model but also offers significant improvements in detectability and robustness over existing unbiased watermarks. Our experiments with widely-used language models demonstrate an improvement in detectability of over 10% using MCmark, compared to existing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Steganography and Watermarking Techniques · Generative Adversarial Networks and Image Synthesis · Handwritten Text Recognition Techniques