Inter-model Interpretability: Self-supervised Models as a Case Study
Ahmad Mustapha, Wael Khreich, Wes Masri

TL;DR
This paper introduces inter-model interpretability using a new embedding space to analyze how self-supervised models relate based on learned visual concepts, revealing insights across tasks and datasets.
Contribution
It proposes a novel inter-model interpretability method using Learned Concepts Embedding to analyze relationships among self-supervised models based on learned concepts.
Findings
Models cluster into three categories based on learned concepts.
Different tasks require distinct types of visual concepts.
The approach reveals how models relate and complement each other.
Abstract
Since early machine learning models, metrics such as accuracy and precision have been the de facto way to evaluate and compare trained models. However, a single metric number doesn't fully capture the similarities and differences between models, especially in the computer vision domain. A model with high accuracy on a certain dataset might provide a lower accuracy on another dataset, without any further insights. To address this problem we build on a recent interpretability technique called Dissect to introduce \textit{inter-model interpretability}, which determines how models relate or complement each other based on the visual concepts they have learned (such as objects and materials). Towards this goal, we project 13 top-performing self-supervised models into a Learned Concepts Embedding (LCE) space that reveals proximities among models from the perspective of learned concepts. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Machine Learning and Data Classification · Adversarial Robustness in Machine Learning
