Unsupervised Model Tree Heritage Recovery
Eliahu Horwitz, Asaf Shul, Yedid Hoshen

TL;DR
This paper introduces an unsupervised method to recover the hierarchical heritage of neural network models by analyzing their weights, enabling the construction of Model Trees to clarify model lineage and address intellectual property concerns.
Contribution
We propose a novel unsupervised approach to reconstruct Model Trees from neural network weights, revealing model lineage without relying on metadata or labels.
Findings
Successfully reconstructs complex Model Trees from weights
Identifies properties of weights that encode model relationships
Formulates the problem as finding a directed minimal spanning tree
Abstract
The number of models shared online has recently skyrocketed, with over one million public models available on Hugging Face. Sharing models allows other users to build on existing models, using them as initialization for fine-tuning, improving accuracy, and saving compute and energy. However, it also raises important intellectual property issues, as fine-tuning may violate the license terms of the original model or that of its training data. A Model Tree, i.e., a tree data structure rooted at a foundation model and having directed edges between a parent model and other models directly fine-tuned from it (children), would settle such disputes by making the model heritage explicit. Unfortunately, current models are not well documented, with most model metadata (e.g., "model cards") not providing accurate information about heritage. In this paper, we introduce the task of Unsupervised Model…
Peer Reviews
Decision·ICLR 2025 Poster
Originality: This paper addresses a new problem: estimating the relationship between models and their fine-tuned versions. However, the significance of this problem for open models is debatable; see the weakness for the detailed comments. Simple approach: The proposed approach based on the distance of model weights is simple. But this is based on a well-known fact that fine-tuning makes small weight changes. Writing: The clarity is mixed; some parts are easy to follow, but certain importan
Limitation 1: The proposed approach can only handle open models, as it relies on model weights. For important open models that have been fine-tuned, information about the pretrained models is often available at the time of release. For models without such information, one can infer relationships based on weight distance. However, it is unclear why this information is needed for all released models. Limitation 2: The proposed approach constructs the model tree based on the weight distances betwe
* Shedding light on the relation of models, in particular, in the LLM regime is crucial. * The numerics are convincing.
* Due to the importance of such a method for legal aspects, some theoretical underpinning should be given, which is currently missing. * The running time of the method is not provided.
Please see the "Questions" section.
Please see the "Questions" section.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsForest ecology and management · Tree-ring climate responses · Conservation, Biodiversity, and Resource Management
MethodsDiffusion · LLaMA
