Multi-Designated Detector Watermarking for Language Models

Zhengan Huang; Gongxian Zeng; Xin Mu; Yu Wang; Yue Yu

arXiv:2409.17518·cs.CR·October 2, 2024·2 cites

Multi-Designated Detector Watermarking for Language Models

Zhengan Huang, Gongxian Zeng, Xin Mu, Yu Wang, Yue Yu

PDF

Open Access

TL;DR

This paper introduces multi-designated detector watermarking for large language models, enabling secure, high-quality watermarked outputs detectable by specific detectors, with optional claimability for ownership assertion.

Contribution

It formalizes the concept of multi-designated detector watermarking, proposes a framework using multi-designated verifier signatures, and introduces claimability as a new security feature.

Findings

01

Effective watermark detection by designated detectors

02

No perceptible quality degradation in watermarked outputs

03

Flexible implementation with satisfactory performance

Abstract

In this paper, we initiate the study of \emph{multi-designated detector watermarking (MDDW)} for large language models (LLMs). This technique allows model providers to generate watermarked outputs from LLMs with two key properties: (i) only specific, possibly multiple, designated detectors can identify the watermarks, and (ii) there is no perceptible degradation in the output quality for ordinary users. We formalize the security definitions for MDDW and present a framework for constructing MDDW for any LLM using multi-designated verifier signatures (MDVS). Recognizing the significant economic value of LLM outputs, we introduce claimability as an optional security feature for MDDW, enabling model providers to assert ownership of LLM outputs within designated-detector settings. To support claimable MDDW, we propose a generic transformation converting any MDVS to a claimable MDVS. Our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Steganography and Watermarking Techniques · Digital Media Forensic Detection · Face recognition and analysis