Learning on Model Weights using Tree Experts
Eliahu Horwitz, Bar Cavia, Jonathan Kahana, Yedid Hoshen

TL;DR
This paper introduces ProbeX, a lightweight probing method that leverages the structure of Model Trees to classify models and predict their training data categories using only model weights, enabling zero-shot model search.
Contribution
The paper presents ProbeX, a novel probing technique designed specifically for single-layer model weights, and demonstrates its effectiveness in classifying models and their training data.
Findings
ProbeX accurately predicts training data categories from model weights.
Model weights within the same Model Tree show less nuisance variation.
ProbeX enables zero-shot model classification via weight-language embedding.
Abstract
The number of publicly available models is rapidly increasing, yet most remain undocumented. Users looking for suitable models for their tasks must first determine what each model does. Training machine learning models to infer missing documentation directly from model weights is challenging, as these weights often contain significant variation unrelated to model functionality (denoted nuisance). Here, we identify a key property of real-world models: most public models belong to a small set of Model Trees, where all models within a tree are fine-tuned from a common ancestor (e.g., a foundation model). Importantly, we find that within each tree there is less nuisance variation between models. Concretely, while learning across Model Trees requires complex architectures, even a linear classifier trained on a single model layer often works within trees. While effective, these linear…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗ProbeX/Model-J__ResNet__model_idx_0182model· 5 dl5 dl
- 🤗ProbeX/Model-J__ResNet__model_idx_0505model· 6 dl6 dl
- 🤗ProbeX/Model-J__ResNet__model_idx_0603model· 5 dl5 dl
- 🤗ProbeX/Model-J__ResNet__model_idx_0610model· 5 dl5 dl
- 🤗ProbeX/Model-J__ResNet__model_idx_0294model· 6 dl6 dl
- 🤗ProbeX/Model-J__ResNet__model_idx_0539model· 4 dl4 dl
- 🤗ProbeX/Model-J__ResNet__model_idx_0516model· 6 dl6 dl
- 🤗ProbeX/Model-J__ResNet__model_idx_0716model· 6 dl6 dl
- 🤗ProbeX/Model-J__ResNet__model_idx_0619model· 4 dl4 dl
- 🤗ProbeX/Model-J__ResNet__model_idx_0599model· 5 dl5 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Bayesian Modeling and Causal Inference
MethodsDiffusion · Sparse Evolutionary Training
