Ghost in the Transformer: Detecting Model Reuse with Invariant Spectral Signatures

Suqing Wang; Ziyang Ma; Li Xinyi; Zuchao Li

arXiv:2511.06390·cs.CR·December 9, 2025

Ghost in the Transformer: Detecting Model Reuse with Invariant Spectral Signatures

Suqing Wang, Ziyang Ma, Li Xinyi, Zuchao Li

PDF

Open Access 1 Video

TL;DR

GhostSpec is a novel, data-free method that uses spectral signatures of attention weights to verify the lineage of large language models, ensuring intellectual property protection and model provenance.

Contribution

It introduces GhostSpec, a lightweight, robust, and non-invasive spectral signature technique for verifying LLM lineage without access to training data.

Findings

01

Robust to fine-tuning, pruning, and adversarial transformations

02

Efficient and requires minimal computational overhead

03

Effective in real-world model verification scenarios

Abstract

Large Language Models (LLMs) are widely adopted, but their high training cost leads many developers to fine-tune existing open-source models. While most adhere to open-source licenses, some falsely claim original training despite clear derivation from public models, raising pressing concerns about intellectual property protection and the need to verify model provenance. In this paper, we propose GhostSpec, a lightweight yet effective method for verifying LLM lineage without access to training data or modification of model behavior. Our approach constructs compact and robust fingerprints by applying singular value decomposition (SVD) to invariant products of internal attention weight matrices. Unlike watermarking or output-based methods, GhostSpec is fully data-free, non-invasive, and computationally efficient. Extensive experiments show it is robust to fine-tuning, pruning, expansion,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Ghost in the Transformer: Detecting Model Reuse with Invariant Spectral Signatures· underline

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Scientific Computing and Data Management · Advanced Malware Detection Techniques