FDLLM: A Dedicated Detector for Black-Box LLMs Fingerprinting
Zhiyuan Fu, Junfan Chen, Lan Zhang, Ting Yang, Jun Niu, Hongyu Sun, Ruidong Li, Peng Liu, Jice Wang, Fannv He, Qiuling Yue, Yuqing Zhang

TL;DR
FDLLM introduces a novel fingerprinting method using parameter-efficient LoRA fine-tuning, supported by a comprehensive bilingual dataset, achieving high accuracy and robustness in identifying proprietary and open-source LLMs.
Contribution
The paper presents FDLLM, a new LLM fingerprinting approach leveraging LoRA fine-tuning and introduces FD-Dataset, a large bilingual benchmark for model identification.
Findings
FDLLM outperforms existing baselines with a 22.1% higher Macro F1 score.
Achieves 95% accuracy on unseen models, demonstrating strong generalization.
Maintains robustness against adversarial attacks, reducing attack success rate significantly.
Abstract
Large Language Models (LLMs) are rapidly transforming the landscape of digital content creation. However, the prevalent black-box Application Programming Interface (API) access to many LLMs introduces significant challenges in accountability, governance, and security. LLM fingerprinting, which aims to identify the source model by analyzing statistical and stylistic features of generated text, offers a potential solution. Current progress in this area is hindered by a lack of dedicated datasets and the need for efficient, practical methods that are robust against adversarial manipulations. To address these challenges, we introduce FD-Dataset, a comprehensive bilingual fingerprinting benchmark comprising 90,000 text samples from 20 famous proprietary and open-source LLMs. Furthermore, we present FDLLM, a novel fingerprinting method that leverages parameter-efficient Low-Rank Adaptation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Authorship Attribution and Profiling · Digital and Cyber Forensics
MethodsFocus
