Unsupervised Model Tree Heritage Recovery

Eliahu Horwitz; Asaf Shul; Yedid Hoshen

arXiv:2405.18432·cs.LG·April 16, 2025

Unsupervised Model Tree Heritage Recovery

Eliahu Horwitz, Asaf Shul, Yedid Hoshen

PDF

Open Access 1 Repo 3 Reviews

TL;DR

This paper introduces an unsupervised method to recover the hierarchical heritage of neural network models by analyzing their weights, enabling the construction of Model Trees to clarify model lineage and address intellectual property concerns.

Contribution

We propose a novel unsupervised approach to reconstruct Model Trees from neural network weights, revealing model lineage without relying on metadata or labels.

Findings

01

Successfully reconstructs complex Model Trees from weights

02

Identifies properties of weights that encode model relationships

03

Formulates the problem as finding a directed minimal spanning tree

Abstract

The number of models shared online has recently skyrocketed, with over one million public models available on Hugging Face. Sharing models allows other users to build on existing models, using them as initialization for fine-tuning, improving accuracy, and saving compute and energy. However, it also raises important intellectual property issues, as fine-tuning may violate the license terms of the original model or that of its training data. A Model Tree, i.e., a tree data structure rooted at a foundation model and having directed edges between a parent model and other models directly fine-tuned from it (children), would settle such disputes by making the model heritage explicit. Unfortunately, current models are not well documented, with most model metadata (e.g., "model cards") not providing accurate information about heritage. In this paper, we introduce the task of Unsupervised Model…

Peer Reviews

Decision·ICLR 2025 Poster

Reviewer 01Rating 5Confidence 3

Strengths

Originality: This paper addresses a new problem: estimating the relationship between models and their fine-tuned versions. However, the significance of this problem for open models is debatable; see the weakness for the detailed comments. Simple approach: The proposed approach based on the distance of model weights is simple. But this is based on a well-known fact that fine-tuning makes small weight changes. Writing: The clarity is mixed; some parts are easy to follow, but certain importan

Weaknesses

Limitation 1: The proposed approach can only handle open models, as it relies on model weights. For important open models that have been fine-tuned, information about the pretrained models is often available at the time of release. For models without such information, one can infer relationships based on weight distance. However, it is unclear why this information is needed for all released models. Limitation 2: The proposed approach constructs the model tree based on the weight distances betwe

Reviewer 02Rating 8Confidence 3

Strengths

* Shedding light on the relation of models, in particular, in the LLM regime is crucial. * The numerics are convincing.

Weaknesses

* Due to the importance of such a method for legal aspects, some theoretical underpinning should be given, which is currently missing. * The running time of the method is not provided.

Reviewer 03Rating 8Confidence 2

Strengths

Please see the "Questions" section.

Weaknesses

Please see the "Questions" section.

Code & Models

Repositories

eliahuhorwitz/MoTHer
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsForest ecology and management · Tree-ring climate responses · Conservation, Biodiversity, and Resource Management

MethodsDiffusion · LLaMA