Gradient-Based Model Fingerprinting for LLM Similarity Detection and Family Classification

Zehao Wu; Yanjie Zhao; Haoyu Wang

arXiv:2506.01631·cs.LG·July 4, 2025

Gradient-Based Model Fingerprinting for LLM Similarity Detection and Family Classification

Zehao Wu, Yanjie Zhao, Haoyu Wang

PDF

Open Access

TL;DR

This paper introduces TensorGuard, a gradient-based fingerprinting method that effectively detects similarity and classifies families of large language models without relying on training data or watermarks.

Contribution

TensorGuard provides a novel, training-data-independent fingerprinting framework for LLMs, enabling model similarity detection and family classification through gradient analysis.

Findings

01

Achieved 94% accuracy in classifying models into five families.

02

Supports widely-used safetensors format for fingerprint extraction.

03

Operates independently of training data, watermarks, or model formats.

Abstract

As Large Language Models (LLMs) become integral software components in modern applications, unauthorized model derivations through fine-tuning, merging, and redistribution have emerged as critical software engineering challenges. Unlike traditional software where clone detection and license compliance are well-established, the LLM ecosystem lacks effective mechanisms to detect model lineage and enforce licensing agreements. This gap is particularly problematic when open-source model creators, such as Meta's LLaMA, require derivative works to maintain naming conventions for attribution, yet no technical means exist to verify compliance. To fill this gap, treating LLMs as software artifacts requiring provenance tracking, we present TensorGuard, a gradient-based fingerprinting framework for LLM similarity detection and family classification. Our approach extracts model-intrinsic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAuthorship Attribution and Profiling · Data Quality and Management · Speech Recognition and Synthesis

MethodsLLaMA · Balanced Selection · k-Means Clustering