Origin Tracer: A Method for Detecting LoRA Fine-Tuning Origins in LLMs
Hongyu Liang, Yuting Zheng, Yihan Li, Yiran Zhang, Shiyu Liang

TL;DR
The paper introduces Origin-Tracer, a novel method for verifying if an LLM was fine-tuned from a specific base model, even under obfuscation, enhancing transparency and trust in model origins.
Contribution
It presents the first formalized approach to detect fine-tuning origins and extract LoRA rank, improving robustness against obfuscation techniques.
Findings
Effectively detects fine-tuning origins under obfuscation scenarios
Successfully extracts LoRA rank during fine-tuning
Establishes new benchmarks for model verification
Abstract
As large language models (LLMs) continue to advance, their deployment often involves fine-tuning to enhance performance on specific downstream tasks. However, this customization is sometimes accompanied by misleading claims about the origins, raising significant concerns about transparency and trust within the open-source community. Existing model verification techniques typically assess functional, representational, and weight similarities. However, these approaches often struggle against obfuscation techniques, such as permutations and scaling transformations. To address this limitation, we propose a novel detection method Origin-Tracer that rigorously determines whether a model has been fine-tuned from a specified base model. This method includes the ability to extract the LoRA rank utilized during the fine-tuning process, providing a more robust verification framework. This…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMagnetic confinement fusion research
MethodsBalanced Selection
