Ghost in the Transformer: Detecting Model Reuse with Invariant Spectral Signatures
Suqing Wang, Ziyang Ma, Li Xinyi, Zuchao Li

TL;DR
GhostSpec is a novel, data-free method that uses spectral signatures of attention weights to verify the lineage of large language models, ensuring intellectual property protection and model provenance.
Contribution
It introduces GhostSpec, a lightweight, robust, and non-invasive spectral signature technique for verifying LLM lineage without access to training data.
Findings
Robust to fine-tuning, pruning, and adversarial transformations
Efficient and requires minimal computational overhead
Effective in real-world model verification scenarios
Abstract
Large Language Models (LLMs) are widely adopted, but their high training cost leads many developers to fine-tune existing open-source models. While most adhere to open-source licenses, some falsely claim original training despite clear derivation from public models, raising pressing concerns about intellectual property protection and the need to verify model provenance. In this paper, we propose GhostSpec, a lightweight yet effective method for verifying LLM lineage without access to training data or modification of model behavior. Our approach constructs compact and robust fingerprints by applying singular value decomposition (SVD) to invariant products of internal attention weight matrices. Unlike watermarking or output-based methods, GhostSpec is fully data-free, non-invasive, and computationally efficient. Extensive experiments show it is robust to fine-tuning, pruning, expansion,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Scientific Computing and Data Management · Advanced Malware Detection Techniques
