Building Intelligence Identification System via Large Language Model Watermarking: A Survey and Beyond
Xuhong Wang, Haoyu Jiang, Yi Yu, Jingru Yu, Yilun Lin, Ping Yi,, Yingchun Wang, Yu Qiao, Li Li, Fei-Yue Wang

TL;DR
This paper surveys the use of watermarking technology for identifying and protecting large language models, proposing a mathematical framework and evaluating performance metrics to enhance security and management.
Contribution
It introduces a mutual information-based framework for LLM watermarking and provides a comprehensive analysis of current methods and challenges in the field.
Findings
Proposed a systematic mutual information framework for watermarking
Evaluated performance metrics reflecting participant preferences
Identified key challenges and future directions in LLM watermarking
Abstract
Large Language Models (LLMs) are increasingly integrated into diverse industries, posing substantial security risks due to unauthorized replication and misuse. To mitigate these concerns, robust identification mechanisms are widely acknowledged as an effective strategy. Identification systems for LLMs now rely heavily on watermarking technology to manage and protect intellectual property and ensure data security. However, previous studies have primarily concentrated on the basic principles of algorithms and lacked a comprehensive analysis of watermarking theory and practice from the perspective of intelligent identification. To bridge this gap, firstly, we explore how a robust identity recognition system can be effectively implemented and managed within LLMs by various participants using watermarking technology. Secondly, we propose a mathematical framework based on mutual information…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Steganography and Watermarking Techniques · Handwritten Text Recognition Techniques · Vehicle License Plate Recognition
