TL;DR
This survey comprehensively reviews methods for protecting large language models' copyrights, focusing on model fingerprinting, and clarifies the relationships among watermarking techniques, highlighting challenges and future directions.
Contribution
It is the first systematic survey to unify text watermarking, model watermarking, and fingerprinting, and introduces new techniques for fingerprint transfer and removal.
Findings
Provides a unified framework for watermarking and fingerprinting techniques.
Systematically categorizes and compares existing model fingerprinting methods.
Introduces novel techniques for fingerprint transfer and removal.
Abstract
Copyright protection for large language models is of critical importance, given their substantial development costs, proprietary value, and potential for misuse. Existing surveys have predominantly focused on techniques for tracing LLM-generated content-namely, text watermarking-while a systematic exploration of methods for protecting the models themselves (i.e., model watermarking and model fingerprinting) remains absent. Moreover, the relationships and distinctions among text watermarking, model watermarking, and model fingerprinting have not been comprehensively clarified. This work presents a comprehensive survey of the current state of LLM copyright protection technologies, with a focus on model fingerprinting, covering the following aspects: (1) clarifying the conceptual connection from text watermarking to model watermarking and fingerprinting, and adopting a unified terminology…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
