Inter-model Interpretability: Self-supervised Models as a Case Study

Ahmad Mustapha; Wael Khreich; Wes Masri

arXiv:2207.11837·cs.CV·August 2, 2022

Inter-model Interpretability: Self-supervised Models as a Case Study

Ahmad Mustapha, Wael Khreich, Wes Masri

PDF

Open Access

TL;DR

This paper introduces inter-model interpretability using a new embedding space to analyze how self-supervised models relate based on learned visual concepts, revealing insights across tasks and datasets.

Contribution

It proposes a novel inter-model interpretability method using Learned Concepts Embedding to analyze relationships among self-supervised models based on learned concepts.

Findings

01

Models cluster into three categories based on learned concepts.

02

Different tasks require distinct types of visual concepts.

03

The approach reveals how models relate and complement each other.

Abstract

Since early machine learning models, metrics such as accuracy and precision have been the de facto way to evaluate and compare trained models. However, a single metric number doesn't fully capture the similarities and differences between models, especially in the computer vision domain. A model with high accuracy on a certain dataset might provide a lower accuracy on another dataset, without any further insights. To address this problem we build on a recent interpretability technique called Dissect to introduce \textit{inter-model interpretability}, which determines how models relate or complement each other based on the visual concepts they have learned (such as objects and materials). Towards this goal, we project 13 top-performing self-supervised models into a Learned Concepts Embedding (LCE) space that reveals proximities among models from the perspective of learned concepts. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Machine Learning and Data Classification · Adversarial Robustness in Machine Learning